Data Profiler¶
Capital One created and maintains this project. It is based on a paper published in 2020, https://arxiv.org/abs/2012.09597.
Logo  | 
 
 | 
|---|---|
Website  | 
|
Repository  | 
|
Byline  | 
The DataProfiler is a Python library designed to make data analysis, monitoring and sensitive data detection easy.  | 
License  | 
Apache 2.0  | 
Project age  | 
2 years 1 months  | 
Backers  | 
|
Lastest News (2022-09-20)  | 
0.8.0 Several minor enhancements and bug fixes. See the release notes for details. more  | 
Size score (1 to 10, higher is better)  | 
3.0  | 
Trend score (1 to 10, higher is better)  | 
5.0  | 
Education Resources¶
URL  | 
Resource Type  | 
Description  | 
|---|---|---|
https://capitalone.github.io/DataProfiler/docs/0.5.3/html/install.html  | 
Documentation  | 
Official project documentation.  | 
Git Commit Statistics¶
Statistics computed using Git data through November 30, 2022.
Statistic  | 
Lifetime  | 
Last 12 Months  | 
|---|---|---|
Commits  | 
8,627  | 
992  | 
Lines committed  | 
9,152,866  | 
274,016  | 
Unique committers  | 
34  | 
20  | 
Core committers  | 
6  | 
8  | 
Similar Projects¶
Project  | 
Size Score  | 
Trend Score  | 
Byline  | 
|---|---|---|---|
4.25  | 
5.0  | 
DataCleaner is a Data Quality toolkit that allows you to profile, correct and enrich your data.  | 
|
7.25  | 
8.0  | 
OpenRefine is a free, open source power tool for working with messy data and improving it  | 
