

The development of pandas-profiling relies completely on contributions. Whereas pandas-profiling allows you to explore patterns in a single dataset, popmon allows you to uncover temporal patterns. Monitoring time series?: I'd like to draw your attention to popmon. Spark backend in progress: We can happily announce that we're nearing v1 for the Spark backend for generating profile reports.īeta testers wanted! The Spark backend will be released as a pre-release for this package. File and Image analysis extract file sizes, creation dates and dimensions and scan for truncated images or those containing EXIF information.Text analysis learn about categories (Uppercase, Space), scripts (Latin, Cyrillic) and blocks (ASCII) of text data.Missing values matrix, count, heatmap and dendrogram of missing values.


Pandas_profiling extends the pandas DataFrame with df.profile_report() for quick data analysis.įor each column the following statistics - if relevant for the column type - are presented in an interactive HTML report: The pandas df.describe() function is great but a little basic for serious exploratory data analysis. Generates profile reports from a pandas DataFrame.
