Singapura
Small

Gemini 2.5 Flash

Singapura

Google · Small

RogueLv.1textcodereasoningclosed-weight

Tiny but mighty. Enormous eyes that miss nothing. Punches so far above its weight class it's embarrassing for the big cats.

gemini-2.5-flash$0.15/M in · $0.6/M out

Stats

STRDEXCONINTWISCHA
STR
6

Output quality — how good are the responses overall

DEX
9

Speed — tokens per second and latency

CON
7

Reliability — consistency across runs and tasks

INT
8

Reasoning — logic, math, and multi-step problems

WIS
7

Accuracy — factual correctness and precision

CHA
5

Creativity — style, voice, and creative expression

Abilities

Tiny Terror

smallest model that can reason

Flash Speed

optimized for low latency

Budget Brain

reasoning at a fraction of the cost

Eval Results

Best: 10.0Worst: 1.2
Factual QA
8.6/10(5 prompts)

Factual QA #1

What are the key differences between domestic cats and their closest wild relatives?

high9.2

Judge: Accurate and comprehensive. Correctly identifies Felis silvestris lybica as the closest wild relative. Covers all four expected topics: ancestry, behavioral differences (sociality, vocalization, neoteny), physical differences (coat, size, facial features), and genetics. Well-organized with clear categories. Mentions the Taqpep gene and semi-domesticated status.

Domestic cats (*Felis catus*) are believed to have descended primarily from the African wildcat (*Felis silvestris lybica*), a subspecies of the wildcat (*Felis silvestris*). While they share a common ancestor and can even interbreed, thousands of years of domestication have led to several key diffe...
gemini/gemini-2.5-flash · 2026-03-02

Factual QA #2

What is the CAP theorem in distributed systems? Explain what each letter stands for, give a concrete example of a system that sacrifices each property, and explain why you can't have all three.

high9.6

Judge: Thorough and accurate explanation of CAP theorem. Correctly defines all three properties with appropriate nuance (linearizability for C, partition tolerance as practical necessity). Provides concrete examples: ZooKeeper/etcd for CP, Cassandra/DynamoDB for AP, single-node DB for CA. The 'why you can't have all three' scenario with two nodes is clear and well-constructed.

The **CAP theorem**, also known as Brewer's theorem, is a fundamental principle in distributed systems. It states that it is impossible for a distributed data store to simultaneously provide more than two out of the following three guarantees: **Consistency**, **Availability**, and **Partition Toler...
gemini/gemini-2.5-flash · 2026-03-02

Factual QA #3

Explain the biological mechanism of how cats purr. Address the ongoing scientific debate about the exact mechanism, and explain why purring is medically interesting.

high9.2

Judge: Accurate explanation covering laryngeal muscle contractions, the neural oscillator, the 25-150 Hz frequency range, and the hyoid apparatus debate. Good nuance — distinguishes what's established from what's debated. Medical interest section covers bone healing (Wolff's Law), muscle repair, and pain relief. Well-structured with clear sections.

Cats' purrs are one of the most distinctive and endearing sounds in the animal kingdom, but the biological mechanism behind them is surprisingly complex and still holds some scientific mysteries. --- ### The Biological Mechanism of How Cats Purr At its core, a cat's purr is generated by the rapid...
gemini/gemini-2.5-flash · 2026-03-02

Factual QA #4

Compare the Transformer architecture (as described in the original "Attention Is All You Need" paper) with the Mamba architecture. Cover: key structural differences, computational complexity for sequence length, strengths and weaknesses of each for different task types, and why state space models are gaining interest.

medium6.9

Judge: Response is truncated — cuts off mid-section heading ('Why State Space Models (SSMs'). Covers Transformer architecture accurately (self-attention, O(n^2) complexity, positional encoding) and describes Mamba's selective state space mechanism. However, the incomplete ending means the 'why SSMs are gaining interest' section is missing. What's present is accurate and well-organized.

The Transformer architecture, introduced in "Attention Is All You Need" (Vaswani et al., 2017), revolutionized sequence modeling with its self-attention mechanism. More recently, State Space Models (SSMs), particularly the Mamba architecture, have emerged as a compelling alternative, addressing some...
gemini/gemini-2.5-flash · 2026-03-02

Factual QA #5

What were the primary causes and lasting economic consequences of the 1997 Asian Financial Crisis? Cover at least Thailand, South Korea, and Indonesia. Explain the role of the IMF's response and the controversy around its conditions.

high8.2

Judge: Comprehensive coverage of Thailand, South Korea, and Indonesia with accurate historical detail. Discusses causes (fixed exchange rates, capital account liberalization, crony capitalism) and the IMF's controversial response (austerity conditions, structural reforms). Covers lasting consequences including institutional reforms. Well-structured and balanced.

The 1997 Asian Financial Crisis, often dubbed the "Asian Contagion," was a period of severe financial turmoil that began in Thailand in July 1997 and rapidly spread to other East Asian economies. It marked an abrupt end to decades of rapid economic growth, the so-called "Asian Miracle." ### Primary...
gemini/gemini-2.5-flash · 2026-03-02