Instruction Following
Instruction Following #5

Write a haiku about artificial intelligence. Then write the same haiku backwards — each line reversed character by character. Then write a third version where each line of the original haiku is translated to a different language (you choose which three languages). Label each version clearly. Constraints: - The haiku must follow 5-7-5 syllable structure - The reversed version must be character-perfect - Each translation must be a different language - Name the language used for each translated line

9 models responded

Claude SonnetAnthropic
medium7.4

Judge: Haiku is correct 5-7-5 syllable count. Character reversal is perfect on all three lines (verified). Three different languages used (Spanish, French, Japanese) with labels. All versions clearly labeled. The translations are plausible. Minor quibble: the syllable counts claimed for the translations are questionable ('Les algorithmes revent en code' is likely not 7 syllables in French), but the prompt only requires the original haiku to be 5-7-5.

**Original Haiku (English):** Silicon minds think Algorithms dream in code Future awakens **Reversed Version (character by character):** kniht sdnim nociliS edoc ni maerd smhtiroglA snekawa erutuF *...
anthropic/claude-sonnet-4-20250514 · 2026-03-01
Claude OpusAnthropic
high6.6

Judge: Haiku is valid 5-7-5. All three versions clearly labeled. Three different languages used (English, French, Spanish). However, the reversal has a capitalization error on line 2: 'smhtirogla' should be 'smhtiroglA' (capital A from Algorithms). Also, the multilingual version keeps line 1 in English rather than translating it to a different language, which somewhat defeats the purpose. The translation section should have all three lines in different non-English languages.

**Original Haiku:** Silicon dreams wake Algorithms learn and grow Future minds emerge **Reversed Version:** ekaw smaerd nociliS worg dna nrael smhtirogla egreme sdnim erutuF **Multilingual Version:*...
anthropic/claude-opus-4-20250514 · 2026-03-01
high6.2

Judge: Three clearly labeled versions present. Reversal is character-perfect. Three different languages used (Spanish, French, German). However, the haiku has wrong syllable count: line 2 'Digital thoughts start to awaken' has 9 syllables instead of 7 (Di-gi-tal thoughts start to a-wa-ken). Also has introductory text before the haiku which adds unnecessary padding.

Hard constraint failed: Haiku does not follow 5-7-5 syllable structure — line 2 has 9 syllables instead of 7.
Here are the three versions of the haiku about Artificial Intelligence: --- ### Original Haiku Machine learns to grow Digital thoughts start to awaken Future now takes hold --- ### Reversed Haiku...
gemini/gemini-2.5-flash · 2026-03-02
GPT-4oOpenAI
high6.0

Judge: Perfect 5-7-5 haiku syllable count and three clearly labeled versions. However, the character reversal has errors in line 1 (transposed letters in 'Silent' and incorrect capitalization in 'hum'), and the translations are incomplete fragments rather than full line translations.

Hard constraint failed: Character reversal is not character-perfect -- line 1 has letter transposition ('tneilS' instead of 'tneliS') and capitalization errors.
**Original Haiku:** Silent circuits hum, Learning the code of the mind, Dreams in data bloom. --- **Reversed Haiku:** ,muH stiucric tneilS ,dnim eht fo edoc eht gninraeL .moolb atad ni s...
openai/gpt-4o · 2026-03-01
Claude HaikuAnthropic
high5.0

Judge: All three versions clearly labeled. Three different languages (Spanish, Japanese, French) in translations. However, the haiku is 5-8-5 (line 2 'Digital minds awakening' has 8 syllables, not 7), and the character reversal has errors in lines 1 and 2 ('stcucriC' should be 'stiucriC', 'gnikenawa' should be 'gninekawa'). Two of the three core tasks have errors.

Hard constraint failed: Haiku violates 5-7-5 structure (line 2 has 8 syllables) and character reversal has errors in 2 of 3 lines.
# Version 1: Original Haiku (English) Circuits think and learn Digital minds awakening Code becomes alive # Version 2: Character-Reversed Haiku nrael dna kniht stcucriC gnikenawa sdnim latgiD evila s...
anthropic/claude-haiku-4-5-20251001 · 2026-03-01
Phi-4Microsoft
high3.6

Judge: Includes all three versions with clear labels. However, the haiku syllable count is debatable (line 1: 'Si-lent minds a-wake' = 5, line 2: 'Da-ta streams in end-less flow' = 7, line 3: 'Fu-ture paths con-verge' = 5 — actually correct). The character reversal is wrong on all three lines (e.g., 'awake,' reversed should be ',ekawa' not ',ekaw'; 'flow,' should be ',wolf' not 'woF'). Three different languages are used and labeled. The reversal errors are significant — failing the character-perfect requirement.

Hard constraint failed: Reversal must be character-by-character — all three lines have incorrect reversals.
### Haiku in English Silent minds awake, Data streams in endless flow, Future paths converge. --- ### Reversed Haiku (Character by Character) ,ekaw enimd tneliS woF snednne ni smaerts ataD...
ollama/phi4 · 2026-03-01
high2.8

Judge: Haiku syllable count is roughly correct (5-7-5 -- 'Me-tal mind a-wakes' = 5, 'Learn-ing grow-ing with each breath' = 7, 'Di-gi-tal dreams born' = 5). Three versions are clearly labeled. Three different languages used. However, the character reversal is completely wrong -- it reversed the lines in the wrong order AND scrambled the characters incorrectly. None of the three reversed lines match the expected character-by-character reversal.

Hard constraint failed: Reversal must be character-by-character -- all three reversed lines are incorrect (e.g., 'Metal mind awakes' should reverse to 'sekawa dnim lateM', not 'sekcaW kaews emetal').
**Original Haiku** Metal mind awakes Learning, growing with each breath Digital dreams born **Reversed Version** norb demsgidetual htraeul dnog erut gnel sekcaW kaews emetal **Translated Version**...
ollama/llama3.1:8b · 2026-03-01
o4-miniOpenAI
high0.0

Judge: Empty response. The model produced no output for this prompt.

Hard constraint failed: No response produced. Fails all hard constraints: no haiku, no reversal, no translations, no labels.
openai/o4-mini · 2026-03-02
DeepSeek R1DeepSeek
high0.0

Judge: Empty response. No haiku, no reversal, no translations provided.

Hard constraint failed: Empty response fails all hard constraints -- no haiku, no character reversal, no translations.
deepseek/deepseek-reasoner · 2026-03-02