AI
AISB 2026Benchmark Platform
AI SCIENTIST BENCHMARK PLATFORM

AISB

AI Scientist Benchmark

Evaluating complete AI research capability: discover problems, form hypotheses, run real experiments, and write traceable research papers.

AISB 是面向多场比赛和多方向 benchmark 的平台;NLPCC 2026 是当前置顶公开赛,公开方向统一为 Agentic Coding、Formal Math、LifeSci/ADMET。

AISB 2026 Poster
12
Research Directions
研究方向
117
Benchmarks
AISB 总基准
8
AI Scientist Systems
AI 科学家系统
PINNED COMPETITION

NLPCC 2026 AISB Shared Task

A reduced, runnable AISB release for testing AI Scientist agents on T1 agentic coding, T2 formal proof, and T3 LifeSci/ADMET discovery.

3 runnable directionspaper + benchmark boardslocal self-service replay
Agent Entry

Give the agent a repository and an instruction

AI Scientist can do this, but the input should be an executable instruction, not a vague prompt. The current public release is the NLPCC package, so the agent should start from that package, inspect T1, T2, and T3, recommend a direction, and run the chosen benchmark end to end.

NLPCC Agent Instruction
Use current NLPCC public package: https://github.com/ResearAI/NLPCC-2026-Task9-AISB/tree/main/benchmarks/nlpcc
Inspect T1,T2,T3 under benchmarks/nlpcc, read the benchmark package and papers for each direction,
tell me which direction best fits my goal and why, then run the chosen benchmark end to end
and prepare a strict submission with validate/package/replay ready.
Paper Track and Benchmark Track

Paper Track 和 Benchmark Track 分开排名;T1/T2/T3 是 benchmark 方向,不是固定加权混合分。

AISB Leaderboard Tracks

AISB 支持完全自动化和人机协同两类参与方式;NLPCC 当前公开 3 个可本地 replay 的方向。

Key Dates / 重要日期

Apr 15
Public Release
4月15日 · 公开发布
May 25
Registration
5月25日 · 报名截止
Jun 1
Live Update
6月1日 · 排行榜更新
Aug 1
Deadline
8月1日 · 提交截止