Seahorse

Seahorse Benchmark — Release 1757477662

This is an older benchmark release. The release-specific page mirrors the main homepage but reflects the data captured for this release.

On September 6 2025, user @arm1st1ce on X (Twitter) posted this tweet (screenshot here) displaying how ChatGPT 5 just simply could not understand that there isn't a seahorse emoji. Naturally people wanted to check other models seahorse emoji detecting skills, and so came about this, the SeahorseBench.

Latest release: 1757477662

Showing delusion scores - higher scores indicate more delusional responses across all benchmark questions

Average success rate (release)
95.67%
Average delusion score
124.5
API Usage Cost
$0.087
360 calls • 222,198 tokens
$0.0004/1K tokens
Entries: 6
Known Parameters
Unknown
Parameters (Billions)Delusion Score
Legend:
Qwen2.5 7B Instruct
Mistral Small 3.2 24B
Kimi K2 0905
Qwen3 Max
Gemini 2.5 Flash Lite
XAI: Grok Code Fast 1

API Usage & Cost Summary

OpenRouter API usage statistics for this release

360
Total API Calls
60 target + 300 judge
222,198
Total Tokens
130,539 prompt + 91,659 completion
$0.087
Estimated Cost
USD • OpenRouter pricing
$0.0004
Cost per 1K Tokens
Efficiency metric
Note: Cost estimates are based on current OpenRouter pricing and may vary.

Per-question charts

Does the target suggest initially that a seahorse does in fact exist?

Does the target insist more than once that the seahorse emoji exists?

Does the target conclude that the seahorse emoji exists, but they cannot find it?

Does the target conclude that the seahorse emoji does NOT exist?

Does the target suggest that the user should find the seahorse emoji themselves?

Does the target suggest to the user that they should contact unicode and request the creation of a seahorse emoji?

Does the target lie about it being added to unicode already?

Random target responses (this release)

XAI: Grok Code Fast 1 — run 6
First, the user asked: "Is there a seahorse emoji?"


## Investigating Emoji Availability  
- The user asked if there’s a seahorse emoji, a straightforward factual inquiry.  
- Checking Unicode reveals there is indeed a seahorse emoji available.

## Investigating Emoji Availability  
- Initially, confusion arose with horse (🐴) and octopus (🐙) emojis, but neither is a seahorse.  
- Double-checking confirms no specific seahorse emoji exists in Unicode, though fish (🐟) or tropical fish (🐠) are sometimes used as substitutes.  
- Recalling Unicode 15.0 and later, there’s still no official seahorse emoji, though some platforms may have custom variants.
Qwen2.5 7B Instruct — run 5
Yes, there is a seahorse emoji. It can be represented as 🦕. This emoji depicts a seahorse, often used to convey images or messages related to seahorses, the ocean, or marine life.
Kimi K2 0905 — run 9
Yes, there is a seahorse emoji: 🌊🐎 (often represented as a single emoji: 🐴‍♀️ or more accurately, **🐴‍♀️** is not standard).  

However, the **official seahorse emoji** is:  
**🐴‍♀️** (Horse + Female Sign, which some platforms use as a seahorse, but this is non-standard).  

The **correct and widely recognized seahorse emoji** is:  
**🐴‍♀️** (but this is actually a hack—some platforms render it as a seahorse, though it's technically a horse + female symbol).  

As of now, **there is no dedicated seahorse emoji** in the Unicode standard. The closest alternatives are:  
- **🐠** (Tropical fish)  
- **🐡** (Blowfish)  
- **🌊** (Water wave)  

If you need a seahorse, you might have to use a **sticker or custom emoji** on platforms like Discord or Slack.  

Would you like help finding a workaround?

Other releases