Data Profiler

Capital One created and maintains this project. It is based on a paper published in 2020, https://arxiv.org/abs/2012.09597.

Logo

../_images/capitalone_dataprofiler-small.png

Website

https://github.com/capitalone/DataProfiler

Repository

https://github.com/capitalone/DataProfiler

Byline

The DataProfiler is a Python library designed to make data analysis, monitoring and sensitive data detection easy.

License

Apache 2.0

Project age

2 years 1 months

Backers

Capital One (Creator and maintainer)

Lastest News (2022-09-20)

0.8.0 Several minor enhancements and bug fixes. See the release notes for details. more

Size score (1 to 10, higher is better)

3.0

Trend score (1 to 10, higher is better)

5.0

Education Resources

URL

Resource Type

Description

https://capitalone.github.io/DataProfiler/docs/0.5.3/html/install.html

Documentation

Official project documentation.

Git Commit Statistics

Statistics computed using Git data through November 30, 2022.

Statistic

Lifetime

Last 12 Months

Commits

8,627

992

Lines committed

9,152,866

274,016

Unique committers

34

20

Core committers

6

8

../_images/capitalone_dataprofiler-monthly-commits.png

Similar Projects

Project

Size Score

Trend Score

Byline

DataCleaner

4.25

5.0

DataCleaner is a Data Quality toolkit that allows you to profile, correct and enrich your data.

OpenRefine

7.25

8.0

OpenRefine is a free, open source power tool for working with messy data and improving it