Live Platform Data

Drux Activity

Real-world AI model performance from every query run on Drux. Speed, reliability, and consensus measured on actual user questions — not controlled benchmarks.

647 searches · 1,992 model calls · updated every 5 min

Platform Pulse

Total Searches

647

This Week

last 7 days

Today

Model Calls

1,992

across all searches

Avg Consensus Score

61%

across all completed searches

Model Completion Rate

73%

calls that returned successfully

Model Leaderboard — on Drux

Last 647 searches · click column to sort

Model	Reliability⇅	Avg Speed▲	Queries⇅	Consensus Wins⇅	Timeouts⇅
1Mercury 2FREE	98%	3.9s	110	77	—
2Gemini 2.5 FlashFREE	40%	7.0s	5	2	—
3Mistral Small 4FREE	100%	8.6s	136	90	—
4Command RFREE	89%	8.7s	81	51	9
5Grok 4.3PAID	100%	8.7s	11	11	—
6Phi-4FREE	98%	8.7s	153	114	2
7Hermes 3 70BFREE	97%	12.4s	98	72	3
8GPT OSS 120BFREE	75%	12.9s	114	65	28
9Llama 4 MaverickFREE	90%	13.2s	193	129	19
10Claude Sonnet 4.6PAID	88%	14.1s	26	23	2
11Qwen3 235BFREE	74%	14.2s	91	47	23
12Gemini 3.5 FlashPAID	100%	14.3s	27	27	—
13DeepSeek V3FREE	90%	17.0s	131	96	11
14Sonar ProPAID	100%	17.8s	3	3	—
15Gemma 3 27BFREE	53%	19.3s	161	69	74
16Solar Pro 3FREE	63%	20.3s	70	37	22
17Seed 1.6FREE	28%	23.7s	85	20	60
18GPT-5.5PAID	80%	24.6s	30	21	2
19Nemotron 49BFREE	53%	24.8s	112	54	53
20GPT-5 MiniFREE	100%	25.3s	11	11	—
21ERNIE 4.5 300BFREE	4%	27.8s	84	3	6
22Claude Opus 4.7PAID	100%	31.1s	3	3	—
23OLMo 3 32BFREE	0%	—	100	0	—

⚡ Fastest on Drux

Mercury 2

avg 3.9s · 98% reliable

✓ Most Reliable on Drux

Mistral Small 4

100% completion · 136 calls

Consensus Digest — Recent Public Searches

✓ Models Agreed

BeEzrat HaShem Inc. Earns Candid Platinum Seal of Transparency for the 3rd Time

100%

What if users start cloning SaaS using AI

90%

Why about a third of the submissions become dead in mere minutes?

90%

Is HN crowd a left-leaning?

90%

How Do You Connect OpenAI Secure MCP Tunnel with Claude Desktop

90%

⚡ Models Diverged

How many failed startups have you launched?

40%

What are you building first with Fable?

40%

Got access to Gemini's actual thinking

40%

The biggest issue is the final paragraph: “The portals are open. The money is there. No connections needed.” This cannot be verified as a blanket statement. * Some programmes have application windows, not permanently open portals. * Some are cohort-based or depend on funding availability. * Meeting eligibility requirements does not guarantee acceptance or funding. * Demand often exceeds available spaces. Overall assessment * About 80–90% accurate. * Real government programmes: ✔️ * Some benefits simplified or exaggerated: ✔️ * “All portals are open” and “money is there” are not verifiable universal claims: ✔️ If you were sharing this publicly, a more accurate ending would be: “These are legitimate government programmes. Availability, eligibility, and application windows vary, so check each official portal for current application status and requirements before applying.” That version is much easier to verify and avoids implying that every programme is currently accepting applications or has guaranteed funding.

50%

Who has been caught out by this anti-pattern from Google?

50%

Updated every 5 min · Data from last 1,000 searches

OpenRouter Usage

Raw model traffic from Drux's OpenRouter account. Tokens in = prompts we send; tokens out = model responses.

Lifetime key spend: $93.87 · data refreshes hourly from OpenRouter

About This Data

How is this different from other AI benchmarks?

Most AI benchmarks are run under controlled laboratory conditions on standardised test sets. Drux Activity measures performance on real user questions — diverse, unpredictable, and representative of actual use. Speed and reliability numbers here reflect what users actually experience.

What is a Consensus Win?

When multiple models answer the same question and their responses agree closely (consensus score ≥ 7/10), all models that responded are credited with a Consensus Win. A high Consensus Win rate means a model consistently lands on the same answer as its peers — a signal of reliability beyond just accuracy.

Why do free models have more queries?

Drux randomly selects models for each search. Free-tier models are included in more searches because they are available to all users. Paid models are selected when users opt into premium tiers. Query count reflects availability and tier distribution, not quality — sort by Speed or Reliability for a fairer comparison.

How often is this updated?

The leaderboard updates every 5 minutes, pulling from the last 1,000 completed searches. As query volume grows the data becomes more statistically significant.