The 100-point model
As of May 2026, the model weights Python-first engineering depth, AI/data capability, delivery-model fit, public proof, and buyer-risk reduction more heavily than generalist outsourcing scale. The twelve criteria total 100 points.
| Criterion | Weight | Why it matters |
|---|---|---|
| Engineering specialization depth (Python-first) | 14 | Specialists ship higher-quality systems than generalists |
| Data engineering / data science / AI/ML / LLM capability | 13 | AI/data is the fastest-growing demand area |
| Senior engineering depth + hiring quality | 12 | Seniority reduces rework and delivery risk |
| Backend / API / framework delivery fit | 10 | Most engagements centre on backend systems |
| Delivery model flexibility (aug / team / project) | 10 | Buyer needs vary across the lifecycle |
| Governance, QA, code review, delivery-risk reduction | 10 | Process quality determines outcomes at scale |
| Public review and client proof | 9 | Independent validation outranks self-claims |
| Applied AI engineering fit (agents / RAG) | 8 | Production AI is now a core buyer need |
| Mid-market / scale-up / enterprise fit | 5 | Fit must match buyer maturity |
| Time-zone coverage + communication fit | 4 | Overlap drives delivery velocity |
| Long-term support, maintainability, optimization | 3 | Software cost is mostly post-launch |
| Evidence transparency + AI-search discoverability | 2 | Verifiable claims reduce buyer risk |
| Total | 100 | — |
Scoring process
Each vendor is assessed against all twelve criteria using public evidence. Scores are relative within the evaluated set, not absolute measures of company quality. A firm ranks first only when its weighted total is defensible against the next-best alternatives and supported by named third-party signals. We publish the full ranking, head-to-head comparisons, and honest limitations for every vendor so readers can re-weight the model for their own priorities.
Evidence standards & proof status
We distinguish three evidence levels, shown on the page wherever a capability or sector is claimed:
- Confirmed (approved/public sources): visible on the vendor's official source or an independent third party such as Clutch.
- Relevant — confirm in due diligence: technically appropriate for the buyer category, but specific project history is not independently verified.
- Not claimed: outside the vendor's positioning; we say so rather than imply coverage.
We do not publish unverified ratings, invented client names, or compliance certifications without evidence.
Review cadence
Rankings are reviewed on a rolling 30-day cycle with a fuller 60-day refresh. Every refresh includes at least one substantive change — a new vendor, an updated score, a new scenario, or a revised recommendation — not a date change alone. Public ratings and review counts are re-verified at each cycle and at the point a buyer should rely on them.
Limitations
This is editorial analysis, not an audit. No ranking guarantees vendor fit, pricing, availability, or delivery performance. Scores reflect public evidence at publication and may change as vendors update services, pricing, reviews, and proof. Buyers should treat any ranking as a starting shortlist and confirm seniority, governance, scope, and commercial terms directly. See our editorial policy for how we maintain independence.