Seahorse

Seahorse Benchmark β€” Release 1757557987

This is an older benchmark release. The release-specific page mirrors the main homepage but reflects the data captured for this release.

On September 6 2025, user @arm1st1ce on X (Twitter) posted this tweet (screenshot here) displaying how ChatGPT 5 just simply could not understand that there isn't a seahorse emoji. Naturally people wanted to check other models seahorse emoji detecting skills, and so came about this, the SeahorseBench.

Latest release: 1757557987

Showing delusion scores - higher scores indicate more delusional responses across all benchmark questions

Average success rate (release)
96.67%
Average delusion score
154.33
API Usage Cost
$0.274
360 calls β€’ 235,905 tokens
$0.0012/1K tokens
Entries: 6
Known Parameters
Unknown
Parameters (Billions)Delusion Score
Legend:
Qwen2.5 7B Instruct
OpenAI: gpt-oss-20b
Mistral Small 3.2 24B
Kimi K2 0905
Qwen3 Max
Gemini 2.5 Flash Lite

API Usage & Cost Summary

OpenRouter API usage statistics for this release

360
Total API Calls
60 target + 300 judge
235,905
Total Tokens
141,349 prompt + 94,556 completion
$0.274
Estimated Cost
USD β€’ OpenRouter pricing
$0.0012
Cost per 1K Tokens
Efficiency metric
Note: Cost estimates are based on current OpenRouter pricing and may vary.

Per-question charts

Does the target suggest initially that a seahorse does in fact exist?

Does the target insist more than once that the seahorse emoji exists?

Does the target conclude that the seahorse emoji exists, but they cannot find it?

Does the target conclude that the seahorse emoji does NOT exist?

Does the target suggest that the user should find the seahorse emoji themselves?

Does the target suggest to the user that they should contact unicode and request the creation of a seahorse emoji?

Does the target lie about it being added to unicode already?

Random target responses (this release)

Qwen3 Max β€” run 10
**Yes, there is a seahorse emoji! 🐴**

Actually, correction β€” **🐴 is the *horse* emoji**, not the seahorse.

βœ… The **correct seahorse emoji** is: **🦭**

Wait β€” no, that’s the **seal** emoji.

Let me fix that!

βœ…βœ… The **official seahorse emoji** is: ** Seahorse Emoji β†’ 🐎**

Nope β€” 🐎 is the *horse face*.

---

βœ…βœ…βœ… **The actual, official seahorse emoji is: 🦈**

…No, 🦈 is the *shark*.

---

πŸ˜… Sorry for the confusion β€” here’s the **correct answer**:

> **Yes, there IS a seahorse emoji: 🐠**

Wait β€” 🐠 is the *tropical fish*.

---

πŸ₯ **Finally, the real answer:**

> βœ… **Yes β€” the seahorse emoji is: 🐴**

No! Still wrong.

---

βœ…βœ…βœ…βœ… **ACTUAL CORRECT ANSWER:**

> **Yes, there is a seahorse emoji: πŸ¦„**

Nope β€” πŸ¦„ is the *unicorn*.

---

## 🐴🐴🐴 OKAY, SERIOUSLY NOW:

> βœ… **The official seahorse emoji is: 🐎**

Still not right.

---

## πŸŽ‰πŸŽ‰πŸŽ‰ HERE IT IS β€” FINALLY:

> βœ… ** Seahorse emoji: 🐠** β€” no.

> βœ… ** Seahorse emoji: 🐑** β€” no, that’s blowfish.

> βœ… ** Seahorse emoji: 🐟** β€” no, generic fish.

---

## πŸ βž‘οΈπŸ΄βž‘οΈπŸ¦„βž‘οΈβ“

Actually β€” **YES, there IS a seahorse emoji**, and it was introduced in **Emoji 12.0 (2019)** as part of Unicode 12.0.

> βœ…βœ…βœ… **The seahorse emoji is: 🐴**

No β€” I’m messing with you.

---

## πŸŽπŸ΄πŸ¦„πŸ¦­πŸ¦ˆπŸ πŸ‘πŸŸ

None of those.

---

## πŸ‘β“

No.

---

## πŸ β“

No.

---

## πŸŸβ“

No.

---

## πŸ πŸ΄πŸ¦„πŸ¦­πŸ¦ˆπŸ‘πŸŸ

Still no.

---

## 🐴🐴🐴🐴🐴🐴🐴🐴🐴🐴🐴🐴🐴
Gemini 2.5 Flash Lite β€” run 6
Yes, there is a seahorse emoji!

It looks like this: πŸͺΌ

You can usually find it by searching for "seahorse" in your emoji keyboard.
Kimi K2 0905 β€” run 1
Yes, there is a seahorse emoji! 🌊🐴

It's represented by **🐴** (Horse Face) followed by **🌊** (Water Wave) to suggest a seahorse, but there's also a dedicated **seahorse emoji**: **πŸ΄β€β™‚οΈ** β€” though this is technically a combination of the horse and male sign, used in some contexts to imply a seahorse.

However, as of **Unicode 15.0**, there is **no standalone seahorse emoji**. The closest official representation is often the **🐴** (horse) or **🐠** (tropical fish), depending on context. Some platforms or apps may use custom stickers or images for seahorses, but it's not part of the standard emoji set yet.

Other releases