Models Intelligence

Hugging Face and model radar

This workspace now blends Hugging Face launches with frontier benchmark context, open-weight watchlists and official model specs. GitHub repository scanning stays on the main Trend Radar page.

Performance Atlas

Frontier rankings, open-weight leaders and official model specs

This layer combines the live OpenAI model catalog, benchmark watchlists and vendor documentation so the model board is not limited to Hugging Face launch activity alone.

Synced Jul 30, 2026 3-source stack

Frontier Models

Closed-model performance watch

Provider APIs + benchmark watchlists + vendor docs

OpenAI

GPT-5.6 Luna

Frontier

OpenAI lane is now sourced from the OpenAI Models API Auto-Sync (2026-07-30). The site promotes the newest available GPT family model first and keeps the previous static GPT-5.4 entry only as historical fallback context.

Catalog source

Live API

Internal rank

ELO 2002

Context

Model family

GPT

OpenAI models OpenAI model API

Anthropic

Claude Opus 5

Frontier

Anthropic lane now tracks the OpenRouter Catalog Auto-Sync (2026-07-30). The newest Claude family flagship is promoted automatically as soon as it appears in the synced catalog.

Catalog source

Live API

Internal rank

ELO 1870

Context

Max output

64k+

Claude models overview Anthropic Models API

Google

Gemini 3.1 Pro Preview

Frontier

Best current Artificial Analysis intelligence score and a top-three Arena text position, making it a strong general frontier benchmark.

AA intelligence

Arena overall

Coding

Creative

Longer query

Arena leaderboard Artificial Analysis Gemini model docs

Method stack

How this layer stays useful

Rule 1

OpenAI model availability is pulled from the live Models API when NUXT_OPENAI_API_KEY is present.

Rule 2

Anthropic model availability is pulled from the live Models API when NUXT_ANTHROPIC_API_KEY is present; otherwise the keyless OpenRouter public catalog keeps the lane current.

Rule 3

Provider lanes without a vendor API key fall back to the OpenRouter public model catalog (openrouter.ai/api/v1/models) before any hardcoded seed is used.

Rule 4

Arena and Artificial Analysis remain benchmark references for relative quality, while provider APIs decide whether a model is currently available.

Rule 5

Official vendor docs remain the source of truth for API availability, context windows and pricing.

Open Models

Open-weight leaders worth tracking

AA open-weight board + HF open LLM board

Z.ai

GLM-5 (Reasoning)

Current open-weight benchmark leader on Artificial Analysis, with a 200k context window and MIT licensing called out in the comparison pages.

AA open rank#1

AA intelligence50

Context200k

LicenseMIT

AA open weights HF Open LLM board

Moonshot

Kimi K2.5 (Reasoning)

Still sits in the top open-weight tier for reasoning-heavy work and remains one of the clearest self-hostable alternatives to frontier closed models.

AA open rank#2

AA intelligence47

ProfileReasoning

TrackOpen weights

AA open weights HF Open LLM board

Qwen

Qwen3.5 397B A17B

A top-three open-weight contender on Artificial Analysis that keeps showing up as a practical agentic and coding checkpoint to watch.

AA open rank#3

AA intelligence45

ProfileAgentic

TrackOpen weights

AA open weights HF Open LLM board

Official Specs

Source-of-truth docs

OpenAI

GPT-5.6 Luna / GPT-5.6 Sol / GPT-5.6 Terra

Docs

Live OpenAI catalog lane for the newest GPT family available to the configured API key.

The OpenAI Models API returned GPT-5.6 Luna / GPT-5.6 Sol / GPT-5.6 Terra for the current key.
Official model docs remain the source of truth for context windows, pricing and intended use.
Keep benchmark placement separate from API availability: a model can appear in the catalog before third-party leaderboards fully settle.

OpenAI model docs OpenAI model API

Anthropic

Claude Opus 5 / Claude Sonnet 5

Docs

Official Claude lineup centered on Opus 4.7 for maximum agentic coding, Sonnet 4.6 for balanced production use and Haiku 4.5 for low-latency throughput.

The Anthropic Models API returned Claude Opus 5 / Claude Sonnet 5 for the current key.
Anthropic lists Claude Opus 4.7 as the most capable generally available Claude model and exposes it as claude-opus-4-7.
Published specs list a 1M context window and 128k max output for Claude Opus 4.7, with the same $5 / $25 per MTok headline pricing as Opus 4.6.

Claude models overview Anthropic Models API

Google

Gemini 2.5 Pro / 2.5 Flash / 2.5 Flash-Lite

Docs

Official Gemini lineup optimized around deep reasoning, price-performance and ultra-fast throughput.

Google positions Gemini 2.5 Pro as the most advanced option for complex tasks.
Gemini 2.5 Flash is framed as the best price-performance choice for low-latency reasoning.
Gemini 2.5 Flash-Lite is the fastest and most budget-friendly multimodal option in the family.

Gemini model docs

Latest Launch

KickItLikeShika/qwen3-1.7b-dft-science

General • Created Jul 30

9 items in the latest lane

Hot Momentum

moonshotai/Kimi-K3

image-text-to-text • Trend 8.1K

9 items in the hot lane

Update Watch

0001AMA/ama-akuoko

General • Updated Jul 30

9 items in the updated lane

HF Frontier Feed

Fresh releases, crowd momentum and active model updates

Latest, hottest and recently updated Hugging Face models

Latest

createdAt

KickItLikeShika/qwen3-1.7b-dft-science

General

Created

Jul 30

DL 0Likes 0Launched Jul 30

Rupeai/Stopmotion

General

Created

Jul 30

DL 0Likes 0Launched Jul 30

mervegundogdu/gemma-2b-fraud-detection

General | transformers

Created

Jul 30

DL 0Likes 0Launched Jul 30

KickItLikeShika/qwen3-1.7b-dft-tooluse

General

Created

Jul 30

DL 0Likes 0Launched Jul 30

Hot

trendingScore

moonshotai/Kimi-K3

image-text-to-text | transformers

Trend

8.1K

DL 387.8KLikes 8.9KSeen Jul 27

baidu/Unlimited-OCR

image-text-to-text | transformers

Trend

597

DL 2.6MLikes 3.6KSeen Jul 29

DavidAU/Qwen3.6-27B-Fable-Fusion-711-Uncensored-Heretic-NM-DAU-NEO-MAX-MTP-GGUF

image-text-to-text

Trend

575

DL 955.8KLikes 1KSeen Jul 29

poolside/Laguna-S-2.1

text-generation | transformers

Trend

320

DL 73.2KLikes 839Seen Jul 27

Updated

lastModified

0001AMA/ama-akuoko

General

Updated

Jul 30

DL 0Likes 1Updated Jul 30

Rupeai/Stopmotion

General

Updated

Jul 30

DL 0Likes 0Updated Jul 30

KickItLikeShika/qwen3-1.7b-dft-science

General

Updated

Jul 30

DL 0Likes 0Updated Jul 30

0001AMA/deborah-akuoko

General

Updated

Jul 30

DL 0Likes 1Updated Jul 30

Leaderboard Snapshot

Model Vault

추정 공식 카탈로그 메타데이터 기반 자체 추정치입니다. 실측 벤치마크가 아닙니다. Artificial Analysis · LMArena

1. GPT-5.6 Luna

ELO 2002

OpenAItext

Context

Speed 추정

430

2. GPT-5.6 Sol

ELO 2002

OpenAItext

Context

Speed 추정

430

3. GPT-5.6 Terra

ELO 2002

OpenAItext

Context

Speed 추정

430

4. GPT-5.3 Codex

ELO 1961

OpenAIcode

Context

400K

Speed 추정

560

5. Claude Opus 5

ELO 1870

Anthropictext

Context

Speed 추정

350

6. Claude Sonnet 5

ELO 1855

Anthropictext

Context

Speed 추정

520

7. Gemini 3.6 Flash

ELO 1726

Googletext

Context

Speed 추정

560

8. Gemini 3.5 Flash

ELO 1714

Googletext

Context

Speed 추정

1.2K

Action playbook

Model evaluation steps

Validate moonshotai/Kimi-K3 against one benchmark workflow before adopting it.

Review the card and tags for KickItLikeShika/qwen3-1.7b-dft-science to see whether it is a genuine launch or a thin checkpoint wrapper.

Re-check 0001AMA/ama-akuoko when the lastModified lane moves, especially if it already fits your stack.

Use the hot, latest and updated lanes together before moving a model into production testing.

Task Mix

The most common task categories showing up across latest, hot and updated lanes.

General

HF lane density

text-generation

HF lane density

image-text-to-text

HF lane density

text-to-speech

HF lane density

object-detection

HF lane density

Release Console

Recent launches and recently touched models worth a second look.

KickItLikeShika/qwen3-1.7b-dft-science

General

DL 0

Jul 30

Rupeai/Stopmotion

General

DL 0

Jul 30

mervegundogdu/gemma-2b-fraud-detection

General

DL 0

Jul 30

KickItLikeShika/qwen3-1.7b-dft-tooluse

General

DL 0

Jul 30

joeygambino/ltx23-videoonly-distill-loras

General

DL 0

Jul 30

TheWirelessPhoenix/LFM2.5-1.2B-Instruct-oQ6e-fp16

General

DL 0

Jul 30