TrendHub Logo
TrendHub

Models Intelligence

Hugging Face and model radar

This workspace now blends Hugging Face launches with frontier benchmark context, open-weight watchlists and official model specs. GitHub repository scanning stays on the main Trend Radar page.

Performance Atlas

Frontier rankings, open-weight leaders and official model specs

This layer combines live Arena rankings, Artificial Analysis benchmark snapshots and vendor documentation so the model board is not limited to Hugging Face launch activity alone.

Verified Apr 2, 2026 3-source stack

Frontier Models

Closed-model performance watch

Arena + Artificial Analysis + vendor docs

Google

Gemini 3.1 Pro Preview

Frontier

Best current Artificial Analysis intelligence score and a top-three Arena text position, making it a strong general frontier benchmark.

AA intelligence

57

Arena overall

#3

Coding

#3

Creative

#2

Longer query

#3

OpenAI

GPT-5.4 (xhigh)

Frontier

OpenAI’s flagship reasoning tier remains tied at the top of Artificial Analysis and still lands in Arena’s top group for text, coding and document work.

AA intelligence

57

Arena overall

#6

Expert

#3

Math

#1

Coding

#4

Method stack

How this layer stays useful

Rule 1

Arena tracks live human preference across text, code, document and search tasks.

Rule 2

Artificial Analysis adds benchmarked intelligence, price, speed and open-weight rankings.

Rule 3

Official vendor docs remain the source of truth for API availability, context windows and pricing.

Open Models

Open-weight leaders worth tracking

AA open-weight board + HF open LLM board

Z.ai

GLM-5 (Reasoning)

Current open-weight benchmark leader on Artificial Analysis, with a 200k context window and MIT licensing called out in the comparison pages.

AA open rank#1
AA intelligence50
Context200k
LicenseMIT

Moonshot

Kimi K2.5 (Reasoning)

Still sits in the top open-weight tier for reasoning-heavy work and remains one of the clearest self-hostable alternatives to frontier closed models.

AA open rank#2
AA intelligence47
ProfileReasoning
TrackOpen weights

Qwen

Qwen3.5 397B A17B

A top-three open-weight contender on Artificial Analysis that keeps showing up as a practical agentic and coding checkpoint to watch.

AA open rank#3
AA intelligence45
ProfileAgentic
TrackOpen weights

Official Specs

Source-of-truth docs

OpenAI

GPT-5.4 / 5.4-mini / 5.4-nano

Docs

Flagship reasoning and coding stack with clearly documented pricing tiers.

  • OpenAI recommends GPT-5.4 as the starting point for complex reasoning and coding work.
  • Official model pages list a 1M context window for GPT-5.4 and 400k for GPT-5.4 mini and nano, with 128k max output across the family.
  • Published pricing is $2.50 / $15 for GPT-5.4, $0.75 / $4.50 for mini and $0.20 / $1.25 for nano per 1M tokens.

Anthropic

Claude Opus 4.6 / Sonnet 4.6 / Haiku 4.5

Docs

Clear split between maximum intelligence, balanced production use and low-latency throughput.

  • Anthropic positions Claude Opus 4.6 as the most intelligent option for building agents and coding.
  • The models overview lists a 1M context window for Opus 4.6 and Sonnet 4.6, with 200k for Haiku 4.5.
  • Published pricing is $5 / $25 for Opus 4.6, $3 / $15 for Sonnet 4.6 and $1 / $5 for Haiku 4.5 per MTok.

Google

Gemini 2.5 Pro / 2.5 Flash / 2.5 Flash-Lite

Docs

Official Gemini lineup optimized around deep reasoning, price-performance and ultra-fast throughput.

  • Google positions Gemini 2.5 Pro as the most advanced option for complex tasks.
  • Gemini 2.5 Flash is framed as the best price-performance choice for low-latency reasoning.
  • Gemini 2.5 Flash-Lite is the fastest and most budget-friendly multimodal option in the family.

Latest Launch

gradients-io-tournaments/tournament-tourn_f4f8798047aeccfb_20260402-fce2627e-6339-452a-b797-327ec8dd75e6-5GU4Xkd3

General • Created Apr 3

9 items in the latest lane

Hot Momentum

Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled

image-text-to-text • Trend 760

9 items in the hot lane

Update Watch

gradients-io-tournaments/tournament-tourn_f4f8798047aeccfb_20260402-fce2627e-6339-452a-b797-327ec8dd75e6-5GU4Xkd3

General • Updated Apr 3

9 items in the updated lane

HF Frontier Feed

Fresh releases, crowd momentum and active model updates

Latest, hottest and recently updated Hugging Face models

Leaderboard Snapshot

Model Vault

1. GPT-5.4 (xhigh)

ELO 1852

OpenAItext

Context

256K

Speed

420

2. Claude Opus 4.6

ELO 1845

Anthropictext

Context

1M

Speed

350

3. GPT-5.3-Codex

ELO 1810

OpenAIcode

Context

128K

Speed

550

4. Gemini 3.1 Flash-Lite

ELO 1785

Googletext

Context

1M

Speed

1.2K

5. Gemini 3 Pro

ELO 1720

Googletext

Context

2M

Speed

480

Action playbook

Model evaluation steps

Validate Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled against one benchmark workflow before adopting it.

Review the card and tags for gradients-io-tournaments/tournament-tourn_f4f8798047aeccfb_20260402-fce2627e-6339-452a-b797-327ec8dd75e6-5GU4Xkd3 to see whether it is a genuine launch or a thin checkpoint wrapper.

Re-check gradients-io-tournaments/tournament-tourn_f4f8798047aeccfb_20260402-fce2627e-6339-452a-b797-327ec8dd75e6-5GU4Xkd3 when the lastModified lane moves, especially if it already fits your stack.

Use the hot, latest and updated lanes together before moving a model into production testing.

Task Mix

The most common task categories showing up across latest, hot and updated lanes.

General

HF lane density

16

image-text-to-text

HF lane density

5

text-generation

HF lane density

3

text-to-image

HF lane density

1

automatic-speech-recognition

HF lane density

1

text-to-speech

HF lane density

1