83/ 100 · B
Industry-defining adoption. A few engineering gaps, but the community carries it.
Robust Speech Recognition via Large-Scale Weak Supervision
Python102,714 starsMITupdated 2mo ago
DocumentationREADME, setup, examples, license
EngineeringTests, CI, linting, lockfiles
Project healthDescription, activity, stars, deps
What to fix first
The highest-impact improvements for this repo.
- 1CI/CDEngineeringInfo
Add `tsc --noEmit`, `mypy`, or `cargo check` to catch type errors before they merge.
- 2CI/CDEngineeringInfo
Upload coverage to Codecov, Coveralls, or report it with `--coverage` flags.
- 3READMEDocumentationInfo
Add CI/build status badges from shields.io or your CI provider to signal project health.
Detailed breakdown
Documentation
89- README90
- README is present.
- README is well structured with multiple sections.
- README includes screenshots or visuals. Great for first impressions.
- README has code examples.
- README links to a live demo or deployed app.
- No status badges in the README (−10 pts).Add CI/build status badges from shields.io or your CI provider to signal project health.
- Install and run instructions90
- README documents how to install the project.
- README documents how to run the project.
- If your project uses environment variables, add a .env.example listing them (+10 pts).Add a .env.example listing all required environment variables so contributors know what to set up.
- License100
- Licensed under MIT.
- Contributing guide70
- CONTRIBUTING guide or docs directory present.
- Optional: add a Code of Conduct (+5 pts).A CODE_OF_CONDUCT.md signals that your project is welcoming. GitHub has a template you can add in one click.
Engineering
76- Tests85
- Test files detected (tests).
- Pytest referenced in pyproject.toml and test files present.
- CI/CD100
Not applicable?
- CI is configured (.github/workflows/test.yml).
- CI workflow runs tests.
- CI runs on pull requests, not just on pushes to main.
- CI workflow runs a lint or format check.
- Optional: add type checking to CI.Add `tsc --noEmit`, `mypy`, or `cargo check` to catch type errors before they merge.
- Optional: report test coverage in CI.Upload coverage to Codecov, Coveralls, or report it with `--coverage` flags.
- CI caches dependencies for faster runs.
- CI tests across multiple environments or versions.
- Linting and formatting60
- Linter or formatter configured (.flake8).
- Reproducibility82
- Lockfile present (requirements.txt). Installs are reproducible.
- No Dockerfile or runtime version pin found. Adding one earns +10 pts.Add a Dockerfile, .nvmrc, or .python-version to pin the runtime version and make the environment reproducible.
- Dependabot configured for github-actions.
- Dependabot only covers one ecosystem (+12 pts; covering 2+ earns +20 pts).Add additional package-ecosystem entries (especially github-actions) to keep all dependencies current.
- Issue and PR templates0
- No issue or PR templates found (−90 pts).Add .github/ISSUE_TEMPLATE/ with bug_report.md and feature_request.md to guide contributors. It dramatically improves issue quality.
- Optional: add a SECURITY.md.A SECURITY.md explains how to responsibly disclose vulnerabilities. Worth adding once the project has real users.
Project health
91- Dependency manifest100
- Dependency manifest found (pyproject.toml).
- pyproject.toml has a [project] table with package metadata.
- pyproject.toml includes a description.
- pyproject.toml specifies requires-python, preventing installs on incompatible versions.
- pyproject.toml has a [build-system] table. The package can be built and published.
- Repository metadata85
- Repository has a description.
- Primary language detected: Python.
- Activity80
- Actively maintained (pushed within 3 months).
- 102,714 stars.
- Housekeeping100
- .gitignore present.
Repository files18 root entries
- .githubGood: CI is configured (.github/workflows/test.yml).Good: Dependabot configured for github-actions.
- data
- notebooks
- testsGood: Test files detected (tests).
- whisper
- .flake8Good: Linter or formatter configured (.flake8).
- .gitattributes
- .gitignoreGood: .gitignore present.
- .pre-commit-config.yaml
- approach.png
- CHANGELOG.mdGood: CONTRIBUTING guide or docs directory present.
- language-breakdown.svg
- LICENSEGood: Licensed under MIT.
- MANIFEST.in
- model-card.md
- pyproject.tomlGood: Dependency manifest found (pyproject.toml).
- README.mdGood: README is present.Good: README is well structured with multiple sections.Good: README includes screenshots or visuals. Great for first impressions.Good: README has code examples.Good: README links to a live demo or deployed app.Info: No status badges in the README (−10 pts).Fix: Add CI/build status badges from shields.io or your CI provider to signal project health.Good: README documents how to install the project.Good: README documents how to run the project.
- requirements.txtGood: Lockfile present (requirements.txt). Installs are reproducible.