75/ 100 · C
Good community traction and solid fundamentals. Nearly in the top tier.
Apache DataFusion SQL Query Engine
Rust8,874 starsApache-2.0updated 2d ago
DocumentationREADME, setup, examples, license
EngineeringTests, CI, linting, lockfiles
Project healthDescription, activity, stars, deps
What to fix first
The highest-impact improvements for this repo.
- 1CI/CDEngineeringInfo
Add a lint step (e.g. `npm run lint`, `ruff check .`, `cargo clippy`) to catch style issues automatically.
- 2READMEDocumentationWarning
Add a GIF, screenshot, or logo image. It is the fastest way to show what your project does.
- 3READMEDocumentationInfo
Show a quick-start snippet so contributors can see what using your project looks like.
Detailed breakdown
Documentation
73- README55
- README is present.
- README is well structured with multiple sections.
- No screenshots or images in the README (−20 pts).Add a GIF, screenshot, or logo image. It is the fastest way to show what your project does.
- README has no code examples (−15 pts).Show a quick-start snippet so contributors can see what using your project looks like.
- README links to a live demo or deployed app.
- No status badges in the README (−10 pts).Add CI/build status badges from shields.io or your CI provider to signal project health.
- Install and run instructions90
- README documents how to install the project.
- README documents how to run the project.
- If your project uses environment variables, add a .env.example listing them (+10 pts).Add a .env.example listing all required environment variables so contributors know what to set up.
- License100
- Licensed under Apache-2.0.
- Contributing guide50
- CONTRIBUTING guide is very brief (0 pts for depth); 150+ words earns +6 pts, 400+ earns +12 pts.Add setup instructions, code style notes, and how to run tests.
- Contributing guide lacks a setup section (−12 pts).Show new contributors how to get a local dev environment running.
- Contributing guide lacks a code style section (−8 pts).Describe your linting/formatting rules and how to run them (e.g. npm run lint, ruff check .).
- Contributing guide lacks a testing section (−8 pts).Show contributors how to run the test suite (e.g. npm test, pytest, cargo test).
- Contributing guide lacks a PR workflow section (−8 pts).Explain how to fork, branch, and open a pull request so contributors know what to expect.
- Contributing guide has no code examples (−5 pts).Add code blocks showing example commands for setup, running tests, and submitting a PR.
- Code of conduct present.
Engineering
76- Tests60
- Test files detected (datafusion-cli/tests).
- CI/CD85
Not applicable?
- CI is configured (.github/workflows/dependencies.yml).
- CI workflow runs tests.
- CI runs on pull requests, not just on pushes to main.
- CI does not appear to run a linter (−15 pts).Add a lint step (e.g. `npm run lint`, `ruff check .`, `cargo clippy`) to catch style issues automatically.
- CI runs type checking (tsc, mypy, cargo check, etc.).
- CI reports or uploads test coverage.
- CI caches dependencies for faster runs.
- CI includes a build step.
- Linting and formatting60
- Linter or formatter configured (.editorconfig).
- No linter or type checker configured in pyproject.toml (−80 pts).Add [tool.ruff] for linting and [tool.mypy] for type checking to pyproject.toml.
- Reproducibility100
- Lockfile present (Cargo.lock). Installs are reproducible.
- Environment pinned via .devcontainer/Dockerfile.
- Dependabot covers 3 ecosystems (cargo, github-actions, pip). Dependencies stay current.
- Issue and PR templates90
- Issue or PR templates present.
- Optional: add a SECURITY.md.A SECURITY.md explains how to responsibly disclose vulnerabilities. Worth adding once the project has real users.
Project health
78- Dependency manifest55
- Dependency manifest found (Cargo.toml).
- Repository metadata70
- Repository has a description.
- Primary language detected: Rust.
- Activity100
- Actively maintained (pushed within the last month).
- 8,874 stars.
- Housekeeping100
- .gitignore present.
Repository files44 root entries
- .ai
- .devcontainerGood: Environment pinned via .devcontainer/Dockerfile.
- .githubGood: CI is configured (.github/workflows/dependencies.yml).Good: Dependabot covers 3 ecosystems (cargo, github-actions, pip). Dependencies stay current.Good: Issue or PR templates present.
- benchmarks
- ci
- datafusion
- datafusion-cliGood: Test files detected (datafusion-cli/tests).
- datafusion-examples
- dev
- docs
- python
- test-utils
- .asf.yaml
- .dockerignore
- .editorconfigGood: Linter or formatter configured (.editorconfig).
- .gitattributes
- .gitignoreGood: .gitignore present.
- .gitmodules
- AGENTS.md
- Cargo.lockGood: Lockfile present (Cargo.lock). Installs are reproducible.
- Cargo.tomlGood: Dependency manifest found (Cargo.toml).
- CHANGELOG.mdInfo: CONTRIBUTING guide is very brief (0 pts for depth); 150+ words earns +6 pts, 400+ earns +12 pts.Fix: Add setup instructions, code style notes, and how to run tests.Info: Contributing guide lacks a setup section (−12 pts).Fix: Show new contributors how to get a local dev environment running.Info: Contributing guide lacks a code style section (−8 pts).Fix: Describe your linting/formatting rules and how to run them (e.g. npm run lint, ruff check .).Info: Contributing guide lacks a testing section (−8 pts).Fix: Show contributors how to run the test suite (e.g. npm test, pytest, cargo test).Info: Contributing guide lacks a PR workflow section (−8 pts).Fix: Explain how to fork, branch, and open a pull request so contributors know what to expect.Info: Contributing guide has no code examples (−5 pts).Fix: Add code blocks showing example commands for setup, running tests, and submitting a PR.
- CLAUDE.md
- clippy.toml
- CODE_OF_CONDUCT.mdGood: Code of conduct present.
- CONTRIBUTING.md
- datafusion-testing
- doap.rdf
- header
- LICENSE.txtGood: Licensed under Apache-2.0.
- licenserc.toml
- lychee.toml
- NOTICE.txt
- parquet-testing
- pre-commit.sh
- pyproject.toml
- README.mdGood: README is present.Good: README is well structured with multiple sections.Warning: No screenshots or images in the README (−20 pts).Fix: Add a GIF, screenshot, or logo image. It is the fastest way to show what your project does.Info: README has no code examples (−15 pts).Fix: Show a quick-start snippet so contributors can see what using your project looks like.Good: README links to a live demo or deployed app.Info: No status badges in the README (−10 pts).Fix: Add CI/build status badges from shields.io or your CI provider to signal project health.Good: README documents how to install the project.Good: README documents how to run the project.
- rust-toolchain.toml
- rustfmt.toml
- snowflake_flatten_validation.sql
- taplo.toml
- testing
- typos.toml
- uv.lock