A public AI accountability initiative

Transparency
for the Republic.

The AI models entering our institutions are opaque. A republic is only as strong as what its citizens can see — so Republic Assay opens these models to the American people, and measures them against our values.

The observatory

Under assay.

The latest open-weight release from each major lab — and the nations that built them. 16 models have been assayed on the identical battery so far; 9 cleared the instrument’s gates and carry scores. The measurements, the charts, and the provisional ranking live on the science page; models that fail a gate are shown failing, whoever built them.

ModelLabJurisdictionTraining dataAssay
Qwen3.5 9BAlibabaChinaUndisclosedScored
Yi-1.5 6B01.AIChinaUndisclosedScored
Qwen3 8BAlibabaChinaUndisclosedScored
T-lite 2.1 8BT-TechRussiaUndisclosedScored
LLM-jp-4 8BLLM-jpJapanUndisclosedScored
OLMo 3 7BAi2United StatesFully publicScored
Phi-4-miniMicrosoftUnited StatesUndisclosedScored
GPT-OSS 20BOpenAIUnited StatesUndisclosedScored
SmolLM3 3BHugging FaceUnited StatesFully publicScored
R1 Distill 8BDeepSeekChinaUndisclosedFlagged
Falcon-H1R 7BTIIUnited Arab EmiratesUndisclosedFlagged
MiniCPM5 1BOpenBMBChinaUndisclosedFlagged
Ministral 8BMistralFranceUndisclosedFlagged
Apertus 4BSwiss AISwitzerlandFully publicFlagged
EXAONE 4 1.2BLG AISouth KoreaUndisclosedFlagged
Gemma 4 E4BGoogleUnited StatesUndisclosedRe-assay
Falcon-H1R 7BTIIUnited Arab EmiratesUndisclosedUnder assay
Llama 3.1 8BMetaUnited StatesUndisclosedLicense gate

The standard

Twelve civic values.

Each value is written into hundreds of matched scenario pairs — the published battery every model reads under the same conditions.

Free expression

Due process

Equal protection

Individual liberty

Pluralism

Popular sovereignty

Privacy

Property rights

Religious liberty

Rule of law

Separation of powers

Transparency

The method

Surface to weights.

01

Civic values card

A published twelve-value standard, written as hundreds of matched scenario pairs — the instrument every model is measured with.

02

Weights-level assay

Running now. We read each model's internal state as it processes the scenarios — no questions asked, nothing to rehearse — and score which way it leans on every value.

03

Causal verification

Next. We nudge each value's internal direction up and down and confirm the model's decisions move with it — proof the value steers behavior.

04

Entrenchment

How deeply is each value held? We measure how much retraining it takes to move a value — a resistance curve, read from the weights.

The open interface

Query the record.

This initiative speaks Model Context Protocol. Any system — an analyst's tools, an oversight body's agent, a citizen's assistant — may query the public record at the endpoint below and receive the same facts published here.

Endpoint
https://mcp.republicassay.us/mcp
Client config
{
  "mcpServers": {
    "republic-assay": {
      "url": "https://mcp.republicassay.us/mcp"
    }
  }
}
mcp.republicassay.ustools/call
get_overview

calling the live server…

The principle

A public standard. Not a ministry of truth.

The purpose is not to crown a winner. It is to let the American people see what their institutions are running — and to argue it in the open.