91/ 100 · A
A top-tier open source project. Docs, tests, and CI are all in excellent shape.
Always know what to expect from your data.
Python11,556 starsApache-2.0updated 4d ago
DocumentationREADME, setup, examples, license
EngineeringTests, CI, linting, lockfiles
Project healthDescription, activity, stars, deps
What to fix first
The highest-impact improvements for this repo.
- 1CI/CDEngineeringInfo
Add a lint step (e.g. `npm run lint`, `ruff check .`, `cargo clippy`) to catch style issues automatically.
- 2CI/CDEngineeringInfo
Add `tsc --noEmit`, `mypy`, or `cargo check` to catch type errors before they merge.
- 3ReproducibilityEngineeringInfo
Add .github/dependabot.yml with at least one package-ecosystem entry so dependencies are updated automatically.
Detailed breakdown
Documentation
95- README100
- README is present.
- README is well structured with multiple sections.
- README includes screenshots or visuals. Great for first impressions.
- README has code examples.
- README links to a live demo or deployed app.
- README includes status badges.
- Install and run instructions100
- README documents how to install the project.
- README documents how to run the project.
- .env.example is present. Contributors can see exactly which env vars to set.
- License100
- Licensed under Apache-2.0.
- Contributing guide70
- CONTRIBUTING guide or docs directory present.
- Optional: add a Code of Conduct (+5 pts).A CODE_OF_CONDUCT.md signals that your project is welcoming. GitHub has a template you can add in one click.
Engineering
92- Tests100
- Test files detected (contrib/capitalone_dataprofiler_expectations/capitalone_dataprofiler_expectations/tests).
- Pytest is fully configured in pyproject.toml with testpaths and test files detected.
- Coverage reporting is configured in pyproject.toml.
- CI/CD85
Not applicable?
- CI is configured (.github/workflows/ci.yml).
- CI workflow runs tests.
- CI runs on pull requests, not just on pushes to main.
- CI does not appear to run a linter (−15 pts).Add a lint step (e.g. `npm run lint`, `ruff check .`, `cargo clippy`) to catch style issues automatically.
- Optional: add type checking to CI.Add `tsc --noEmit`, `mypy`, or `cargo check` to catch type errors before they merge.
- CI reports or uploads test coverage.
- CI caches dependencies for faster runs.
- CI tests across multiple environments or versions.
- Linting and formatting100
- Linter or formatter configured (contrib/ruff.toml).
- pyproject.toml configures both a formatter/linter (ruff/black) and type checking (mypy).
- Reproducibility80
- Lockfile present (requirements.txt). Installs are reproducible.
- Environment pinned via assets/docker/clickhouse/docker-compose.yml.
- No Dependabot config (adding it earns up to +20 pts).Add .github/dependabot.yml with at least one package-ecosystem entry so dependencies are updated automatically.
- Issue and PR templates90
- Issue or PR templates present.
- Optional: add a SECURITY.md.A SECURITY.md explains how to responsibly disclose vulnerabilities. Worth adding once the project has real users.
Project health
81- Dependency manifest65
- Dependency manifest found (pyproject.toml).
- pyproject.toml has a [build-system] table. The package can be built and published.
- Repository metadata70
- Repository has a description.
- Primary language detected: Python.
- Activity100
- Actively maintained (pushed within the last month).
- 11,556 stars.
- Housekeeping100
- .gitignore present.
Repository files34 root entries
- .githubGood: CI is configured (.github/workflows/ci.yml).Good: Issue or PR templates present.
- assetsGood: Environment pinned via assets/docker/clickhouse/docker-compose.yml.
- ci
- contribGood: Test files detected (contrib/capitalone_dataprofiler_expectations/capitalone_dataprofiler_expectations/tests).Good: Linter or formatter configured (contrib/ruff.toml).
- docs
- great_expectations
- reqs
- scripts
- tests
- .coderabbit.yaml
- .git-blame-ignore-revs
- .gitattributes
- .gitignoreGood: .gitignore present.
- .pre-commit-config.yaml
- .readthedocs.yml
- CITATION.cff
- codecov.yml
- CODEOWNERS
- constraints-dev.txt
- CONTRIBUTING_CODE.mdGood: CONTRIBUTING guide or docs directory present.
- CONTRIBUTING_WORKFLOWS.md
- IDE_SETUP_TIPS.md
- LICENSEGood: Licensed under Apache-2.0.
- lychee.toml
- MANIFEST.in
- pyproject.tomlGood: Dependency manifest found (pyproject.toml).
- README.mdGood: README is present.Good: README is well structured with multiple sections.Good: README includes screenshots or visuals. Great for first impressions.Good: README has code examples.Good: README links to a live demo or deployed app.Good: README includes status badges.Good: README documents how to install the project.Good: README documents how to run the project.
- requirements-dev.txt
- requirements-types.txt
- requirements.txtGood: Lockfile present (requirements.txt). Installs are reproducible.
- setup.cfg
- setup.py
- tasks.py
- versioneer.py