DataProfiler

Capital One created and maintains this project. It is based on a paper published in 2020, https://arxiv.org/abs/2012.09597.

Logo

../_images/capitalone_dataprofiler-small.png

Website

https://github.com/capitalone/DataProfiler

Repository

https://github.com/capitalone/DataProfiler

Byline

The DataProfiler is a Python library designed to make data analysis, monitoring and sensitive data detection easy.

License

Apache 2.0

Project age

0 years 8 months

Backers

Capital One (Creator and maintainer)

Lastest News (2021-06-02)

Data Profiler 0.5.0 Major release, unstructured profiles can now be generated. more

Size score (1 to 10, higher is better)

2.0

Trend score (1 to 10, higher is better)

8.25

Education Resources

URL

Resource Type

Description

https://capitalone.github.io/DataProfiler/docs/0.5.3/html/install.html

Documentation

Official project documentation.

Git Commit Statistics

Statistics computed using Git data through June 30, 2021.

Statistic

Lifetime

Last 12 Months

Commits

245

245

Lines committed

371,190

371,190

Unique committers

15

15

Core committers

4

4

../_images/capitalone_dataprofiler-monthly-commits.png

Similar Projects

Project

Size Score

Trend Score

Byline

DataCleaner

3.75

2.0

DataCleaner is a Data Quality toolkit that allows you to profile, correct and enrich your data.

OpenRefine

6.5

8.0

OpenRefine is a free, open source power tool for working with messy data and improving it