Profiling-for-Hive

This project is for ''Visualized profiling large-scale query execution''.

Current version supports profiling (SQL) query execution at task-level on Hive.

The updated version will be relased soon.

Usage

This is for extracting useful info. from log for profiling.

Usage: Step1: obtain yarn log after Hive execution (e.g., "ds100_query12");

Step2: linux shell "cat ds100_query4.log| grep -E "Fang: Tez|Container: container" > tmp.log" then we obtain a log that contains all the statistics info. useful for profiling.

Step3: g++ --std=c++11 main.cpp -o extracter then we obtain executable file "a.out".

Step4: ./extracter then we obtain the statistics file "Profile.csv".

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Profiling		Profiling
profile_extracter/profile_extracter		profile_extracter/profile_extracter
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Profiling-for-Hive

Usage

About

Releases

Packages

Languages

Eilowangfang/Profiling-for-Hive

Folders and files

Latest commit

History

Repository files navigation

Profiling-for-Hive

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages