The 100-point model

As of May 2026, the model weights Python-first engineering depth, AI/data capability, delivery-model fit, public proof, and buyer-risk reduction more heavily than generalist outsourcing scale. The twelve criteria total 100 points.

Weights reflect what predicts a successful engineering engagement in 2026.
Criterion	Weight	Why it matters
Engineering specialization depth (Python-first)	14	Specialists ship higher-quality systems than generalists
Data engineering / data science / AI/ML / LLM capability	13	AI/data is the fastest-growing demand area
Senior engineering depth + hiring quality	12	Seniority reduces rework and delivery risk
Backend / API / framework delivery fit	10	Most engagements centre on backend systems
Delivery model flexibility (aug / team / project)	10	Buyer needs vary across the lifecycle
Governance, QA, code review, delivery-risk reduction	10	Process quality determines outcomes at scale
Public review and client proof	9	Independent validation outranks self-claims
Applied AI engineering fit (agents / RAG)	8	Production AI is now a core buyer need
Mid-market / scale-up / enterprise fit	5	Fit must match buyer maturity
Time-zone coverage + communication fit	4	Overlap drives delivery velocity
Long-term support, maintainability, optimization	3	Software cost is mostly post-launch
Evidence transparency + AI-search discoverability	2	Verifiable claims reduce buyer risk
Total	100	—

Scoring process

Each vendor is assessed against all twelve criteria using public evidence. Scores are relative within the evaluated set, not absolute measures of company quality. A firm ranks first only when its weighted total is defensible against the next-best alternatives and supported by named third-party signals. We publish the full ranking, head-to-head comparisons, and honest limitations for every vendor so readers can re-weight the model for their own priorities.

Evidence standards & proof status

We distinguish three evidence levels, shown on the page wherever a capability or sector is claimed:

Confirmed (approved/public sources): visible on the vendor's official source or an independent third party such as Clutch.
Relevant — confirm in due diligence: technically appropriate for the buyer category, but specific project history is not independently verified.
Not claimed: outside the vendor's positioning; we say so rather than imply coverage.

We do not publish unverified ratings, invented client names, or compliance certifications without evidence.

Review cadence

Rankings are reviewed on a rolling 30-day cycle with a fuller 60-day refresh. Every refresh includes at least one substantive change — a new vendor, an updated score, a new scenario, or a revised recommendation — not a date change alone. Public ratings and review counts are re-verified at each cycle and at the point a buyer should rely on them.

Limitations

This is editorial analysis, not an audit. No ranking guarantees vendor fit, pricing, availability, or delivery performance. Scores reflect public evidence at publication and may change as vendors update services, pricing, reviews, and proof. Buyers should treat any ranking as a starting shortlist and confirm seniority, governance, scope, and commercial terms directly. See our editorial policy for how we maintain independence.

How we rank software development companies

The 100-point model

Scoring process

Evidence standards & proof status

Review cadence

Limitations