Skip to content

Commit

Permalink
Fix profiler
Browse files Browse the repository at this point in the history
  • Loading branch information
argenisleon committed Dec 6, 2019
1 parent 2dbf018 commit c44bc33
Show file tree
Hide file tree
Showing 6 changed files with 5,875 additions and 604 deletions.
2 changes: 1 addition & 1 deletion optimus/profiler/profiler.py
Original file line number Diff line number Diff line change
Expand Up @@ -396,7 +396,7 @@ def columns_stats(self, df, columns, buckets=10, infer=False, relative_error=REL
logger.print("Processing Stats For columns...")

# Get columns data types. This is necessary to make the pertinent histogram calculations.
count_by_data_type = df.cols.count_by_dtypes(columns, infer=infer, mismatch=mismatch)
count_by_data_type = df.cols.count_by_dtypes(columns, infer=infer)

count_by_data_type_no_mismatch = copy.deepcopy(count_by_data_type)

Expand Down
3,125 changes: 3,116 additions & 9 deletions tests/creator/creator-profiler.ipynb

Large diffs are not rendered by default.

6 changes: 6 additions & 0 deletions tests/creator/creator-profiler.py
Original file line number Diff line number Diff line change
Expand Up @@ -151,4 +151,10 @@ def func(col_name, attrs):
import json
json.dumps("{'name'=a'a}")

from optimus.profiler.profiler import Profiler

op.profiler.run(source_df, "*")

source_df.cols.range("height(ft)")


Loading

0 comments on commit c44bc33

Please sign in to comment.