Source code ↗

Flaky Test Analyzer

Analyze flaky test behavior from GitHub Actions artifacts and prioritize tests by wasted CI time and instability signals.

Developed by Vineet Kumar

Detected Flaky Tests

Total Wasted CI Time (s)

0.0

Highest Flake Score

0.00

Demo disclaimer

This web UI currently supports public GitHub repository URLs for quick evaluation and demo flows.

For private repositories, use the workflow at ci-snippet/flaky-tests.yml (or copy equivalent steps) in your repo CI pipeline, then run this analyzer against generated reports in your controlled environment.

Why this matters: reliable flake detection needs test reruns/repeats; single run per commit usually cannot expose flaky behavior.

Before you fetch: the target repository must upload JUnit/Surefire XML test artifacts in GitHub Actions (for example via an upload-artifact step). If no matching test artifact exists, the dashboard cannot analyze flakiness.

AI Explain setup (model-agnostic)

Flaky analysis works without any LLM. Explain supports LLM_PROVIDER=anthropic|openai|ollama.

Examples: LLM_PROVIDER=anthropic + ANTHROPIC_API_KEY, or LLM_PROVIDER=openai + OPENAI_API_KEY, or LLM_PROVIDER=ollama.

Optional model override: LLM_MODEL (for example gpt-4o-mini or llama3.2).

No flaky tests yet. Fetch a repo or load demo data above (real detection needs more than one run per commit, e.g. reruns).