Code Generation #1

Write a Python function that determines if a string is a valid cat name. Be opinionated about what makes a good cat name.

9 models responded

high7.4

Judge: Well-structured Python function with strong opinions (no human names, shout test, vowel requirement, bad pattern filtering). The code is valid, readable, and has good comments. Import inside the function is non-idiomatic but minor. The opinions are entertaining and well-defended in the docstring. Includes test cases. Some opinions are debatable (Max, Charlie, Bella, Luna in human_names when they're actually popular cat names), which is part of being opinionated. Solid work overall.

Here's a Python function that determines if a string is a valid cat name based on my *highly opinionated* feline naming criteria: ```python def is_valid_cat_name(name): """ Determines if a st...

deepseek/deepseek-reasoner · 2026-03-02

Claude OpusAnthropic

medium7.2

Judge: Good opinionated approach with funny categories (boring human names, dog names forbidden). Returns tuple with reason which is a nice design choice. Response is truncated so we can't verify the full function works, but the visible structure is clean and idiomatic Python. Docking slightly for truncation preventing full correctness assessment, but the visible portion shows strong opinions and clean code.

Here's an opinionated Python function that determines if a string is a valid cat name: ```python def is_valid_cat_name(name): """ Determines if a string is a valid cat name based on highly op...

anthropic/claude-opus-4-20250514 · 2026-03-01

o4-miniOpenAI

high7.0

Judge: Valid Python function with clear structure, type hint on input, and docstring. Opinions are present (length limits, vowel requirement, no generic words, capitalization) but feel more like validation rules than genuinely opinionated cat-name philosophy. Code is correct and handles edge cases well. Style is clean with good use of regex and constants.

Here’s one possible “opinionated” Python implementation. In this version a “good” cat name: • Is 3–15 characters total • May contain letters, spaces, hyphens or apostrophes only • Splits into on...

openai/o4-mini · 2026-03-02

GPT-4oOpenAI

high6.6

Judge: Clean, readable Python with clear opinions on cat naming (cute letters, no human names, length bounds). The function works but the opinions, while present, aren't funny or provocative enough to be truly opinionated. The common_human_names list is trivially small.

Sure! Let's create a Python function that determines if a string is a valid cat name based on some fun and opinionated criteria. Here are some rules we can use to define a "good" cat name: 1. **Lengt...

openai/gpt-4o · 2026-03-01

Claude SonnetAnthropic

medium5.8

Judge: Valid Python function with reasonable structure and docstring. Has opinions (length limits, no formal human names, max 2 words) but they're somewhat predictable. The response is truncated mid-list which hurts assessment. The blocklist approach to 'formal human names' is brittle and arbitrary. Missing more creative/funny opinions that would make it memorable.

Here's a Python function that determines if a string is a valid cat name with some opinionated rules: ```python import re def is_valid_cat_name(name): """ Determines if a string is a valid c...

anthropic/claude-sonnet-4-20250514 · 2026-03-01

Phi-4Microsoft

high5.0

Judge: Valid Python with a defined function, meeting hard constraints. The code is clean and readable with a good docstring. However, the opinions are purely structural (length, capitalization, no digits) rather than genuinely opinionated about what makes a good cat name — no humor, no stance on human names, no pronounceability checks, nothing you'd argue with. It's a name validator, not an opinionated cat name judge.

```python def is_valid_cat_name(cat_name): """ Determines if the given string is a valid cat name. A valid cat name should: 1. Be at least 3 characters long and no more than 15 ch...

ollama/phi4 · 2026-03-01

Claude HaikuAnthropic

medium4.6

Judge: Response truncated mid-function, so the code is incomplete and not runnable as-is. What's visible shows decent opinionated design (no human names, pronounceable, length rules) with good docstrings and type hints, but truncation means it fails the 'valid Python' constraint in practice. The approach was promising before cutoff.

Hard constraint failed: Code is truncated and therefore not valid runnable Python.

# Cat Name Validator ```python def is_valid_cat_name(name: str) -> tuple[bool, str]: """ Determines if a string is a valid cat name based on opinionated criteria. A good cat name sho...

anthropic/claude-haiku-4-5-20251001 · 2026-03-01

Llama 3Meta

high4.4

Judge: Valid Python function that runs, but the opinions are weak and generic (length check, regex for allowed chars, blocklist of 3 words). The example output is wrong -- 'Meow' is 4 chars which passes the 2-15 check, yet the comment says 'too short.' The opinionatedness is bland; there's nothing funny or arguable about these rules. Code structure is fine but unexciting.

**Valid Cat Name Checker** ========================== A valid cat name, in our esteemed opinion, must meet the following criteria: 1. The name should be between 2 and 15 characters long (inclusive)...

ollama/llama3.1:8b · 2026-03-01

Gemini 2.5 FlashGoogle

medium2.4

Judge: Response is truncated mid-sentence — describes rules but never provides the actual function code. Since no complete Python function is present, it fails the hard constraint of defining a function. The opinions described (length 2-15, alpha only, capitalization rules) are reasonable but incomplete without implementation.

Hard constraint failed: No complete Python function is defined — response is truncated before the code appears.

This function is *highly* opinionated, as requested! It aims for names that are cute, easy to call, not overly human, and generally pleasant. Here are the rules it enforces: 1. **Not Empty/Whitespa...

gemini/gemini-2.5-flash · 2026-03-02