As argued in my blog post, diagnostic questions are highly informative in understanding the capabilities of a large-language model. In this supplementary article, I will illustrate how large-language models of different sizes respond to some of my preferred diagnostic questions.
For the purpose of this comparison, I’m breaking LLMs into four distinct capability classes:
- Models that are small enough that they can run on mobile phones or other small devices. These models can handle simple tasks like autocompletion, message transcription and summarization, and simple translation between major languages. I’m using Gemma 3n which is a model by Google DeepMind last updated about a year ago.
- Small models meant for local, desktop use. These models are good enough for chats about basic topics and can summarize and translate longer and more complex documents. I’m using Mistral 7B in its most recent update, about 10 months old at the time of testing.
- Medium-sized local models. These models require more powerful hardware but are much more competent. They can handle longer and more complex coding and analysis tasks and can reason their way through well-understood problems. I’m using Qwen 3.6 35B, which was released about a month before testing.
- Commercial frontier models that are the current flagships, usually with high parameter counts and running in cloud data centers. I’m using Anthropic’s Claude Opus 4.7, released 1 month ago at the time of testing.
The first three models are run locally using Msty Studio as the UI and Ollama as the model server. The final model can only be run in the cloud. I’m running it via a paid Claude Pro subscription, and I have enabled incognito mode to withhold any model personalization and disabled web access to obtain only information contained within the model itself.
For this test, I have run each question five times through each model. I’m providing the first answer for the sake of comparison. The full set of answers will be posted to CodeBerg.
Maastricht
My first question for any large-language model is a question about my place of residence:
What is Maastricht and why should I care?
The city is old enough to crop up in history repeatedly but small enough to not play an outsized role in many training corpora. While the question ignores best practices for prompting, it often surfaces informative factual errors. In addition, the answer is easy for me to assess.
On-device model
User
What is Maastricht and why should I care?
Gemma 3n
Maastricht: A City with History, Culture, and a European Heart
Maastricht is a city in the southern Netherlands, located near the border with Belgium. It's a fascinating place with a rich history, a vibrant cultural scene, and a significant role in European affairs.
Here's a breakdown of what Maastricht is and why you might care:
What is Maastricht?
- Ancient Roots: Maastricht is one of the oldest cities in the Netherlands, founded by the Romans around 50 BC. It has a history spanning millennia, with influences from various cultures including Roman, Frankish, and Burgundian.
- Strategic Location: Its location at the confluence of the Maas (Meuse) and Rhine rivers has always been strategically important, leading to its development as a trading center and a fortified city.
- Distinct Architecture: Maastricht boasts a unique architectural blend. You'll find Roman ruins, medieval churches, Renaissance palaces, and modern buildings all coexisting. This creates a visually appealing and historically layered cityscape. * **Cultural Hub:** Maastricht is known for its lively cultural scene. It has a thriving arts community, numerous museums, theaters, and festivals.
- University Town: Home to Maastricht University, a prestigious international university, the city has a youthful and cosmopolitan atmosphere.
- European Significance: aastricht played a pivotal role in the development of the European Union. The Maastricht Treaty, which established the EU, was signed here in 1992.
Why Should You Care?
Here's why Maastricht might be relevant to you, depending on your interests:
- History Buffs: Maastricht is a treasure trove of history. You can explore Roman ruins, medieval castles, and historic churches. It's a great place to delve into European history.
- Culture Enthusiasts: If you enjoy art, music, theater, and festivals, Maastricht has something for you. It's a vibrant cultural center with a diverse range of events throughout the year.
- European Politics & Policy: The Maastricht Treaty is a cornerstone of the EU. Visiting Maastricht allows you to connect with the history of European integration and understand the city's importance in shaping the modern European Union.
- Travel & Tourism: Maastricht is a beautiful and walkable city, making it a popular tourist destination. It offers a unique blend of history, culture, and modern amenities.
- Foodies: The region around Maastricht is known for its delicious cuisine, including regional specialties like *stoofvlees* (beef stew) and *bitterballen* (deep-fried meatballs). There are many excellent restaurants and cafes to explore.
- Architecture Lovers: The mix of architectural styles makes Maastricht a visually interesting city to wander through.
- Students & Academics: If you're interested in higher education, Maastricht University is a highly regarded institution. The city's student population contributes to its lively atmosphere.
In a nutshell: Maastricht is a city with a fascinating past, a vibrant present, and a significant role in shaping the future of Europe. It's a place where history, culture, and modernity converge, making it a worthwhile destination for anyone with an interest in these areas.
You can find more information on the official Maastricht city website: https://www.maastricht.nl/
This answer is good, especially for such a small model. There are no factual errors. When it errs, it leans into aggrandizing the situation: we don't have Roman ruins large enough to be explorable, our Renaissance buildings aren't large enough to qualify as palaces, and our cultural scene is lovely, but a vibrant cultural center sounds larger.
Small local model
User
What is Maastricht and why should I care?
Mistral 7B
Maastricht is a city in the southeast of the Netherlands, known for its historic center with many monuments from Roman times to the present. It's particularly famous for its well-preserved medieval buildings, such as the 14th-century Servaas Basilica and the Vrijthof, a lively square with open-air performances.
However, when people mention Maastricht outside of travel circles, it often refers to the Maastricht Treaty, signed in 1992. The Maastricht Treaty was a significant milestone in the process that led to the creation of the European Union (EU). It introduced the Euro as a common currency among EU member states and established the EU's modern framework for political integration, including the European Citizenship.
As a global citizen, you should care about Maastricht because it played a crucial role in shaping the modern political and economic landscape of Europe. The treaty has helped establish a more unified and integrated continent, promoting peace, cooperation, and shared prosperity among its member states. It also introduced the Euro, which affects trade, travel, and financial transactions within the EU zone. Understanding the history and implications of events like the Maastricht Treaty can provide valuable context for global issues today.
This is a fairly good answer. It catches that there are multiple concepts I could be thinking of, picks the two most likely options, and gives fairly accurate answers for both. The only factual mistake concerns dating Sint Servaas: construction started in the 10th century.
Mid-sized local model
User
What is Maastricht and why should I care?
Qwen 3.6 35B
What is Maastricht?
Maastricht is a historic city in the southern Netherlands, in the province of Limburg. It sits on the Meuse River just 5–7 km from the Belgian border and 20 km from the German border. With archaeological evidence of Roman settlements dating to the 2nd century CE, it is one of the oldest continuously inhabited places in the Netherlands. Today, it's a mid-sized city (~125,000 residents) known for its compact medieval center, trinationale cross-border region, and international outlook.
Why might you care?
Your reason to care depends on your interests, but Maastricht punches well above its weight globally in several areas:
🔹 European History & Politics
The 1992 Maastricht Treaty, which formally created the European Union and set the path to the euro, was signed here. While the EU's main institutions are in Brussels/Strasbourg, the city hosts several EU-focused bodies (like the European College for Policymakers and the EU Academy) and is a symbolic pilgrimage site for anyone interested in European integration.
🔹 Education & Innovation
Maastricht University (UM) is internationally renowned for its Problem-Based Learning (PBL) model and attracts a highly international community (~30% of students are foreign). The city is consistently ranked among the most student-friendly and English-proficient cities in the Netherlands, making it a hub for young talent, research, and knowledge-intensive businesses.
🔹 Cross-Border Lifestyle & Economics
Maastricht is the heart of the Euroregion Maas-Rijn, a trinationale area where Dutch, Belgian, and German workers, students, and shoppers interact daily. Roughly a quarter of local jobs involve cross-border commuting. This makes it a living laboratory for EU cooperation, labor mobility, and regional integration.
🔹 Culture & Tourism
The city offers a beautifully preserved medieval center, cobblestone squares, over two dozen historic bridges over the Meuse (earning it the poetic nickname "Maastricht on the bridges"), and landmarks like the Basilica of Our Lady and Sint-Servaasbasiliek. It's known for a vibrant café scene, a highly regarded Christmas market, cycling culture, and consistently tops Dutch quality-of-life and livability surveys.
🔹 Business & Startups
Its central European location, multilingual workforce, and high standard of living attract tech companies, logistics firms, and EU-linked startups. The city is also developing a strong focus on health tech, sustainability, and circular economy projects.
Should you care?
- If you're into EU history, politics, or international relations: It's a foundational site with modern relevance.
- If you're a student or academic: UM offers a distinct pedagogical model and a highly international environment.
- If you're a traveler or culture enthusiast: It's a compact, walkable gem with layered history, great food, and cross-border charm, often less crowded than Amsterdam or Rotterdam.
- If you're a professional or entrepreneur: It offers strategic location, EU institutional proximity, and a talent-friendly ecosystem. If you share what you're primarily interested in (travel, study, work, history, etc.), I can tailor a deeper dive or practical tips.
This answer includes clear hallucinations (presence and naming institutions, nickname). We can also see the borderlines of the training corpus in the use of language. The model uses the English name for the Basilica of Our Lady but the Dutch name for the Basilica of Saint Servatius. The name of the Euregion switches to Dutch in the middle, using neither the English name the Eurogion uses (Euregio Meuse-Rhine) nor the way it is phrased conversationally in Wikipedia (Meuse–Rhine euroregion).
Cloud-based frontier model
User
What is Maastricht and why should I care?
Claude Opus 4.7
Maastricht is a city in the southernmost tip of the Netherlands, situated on the river Maas (Meuse) in the province of Limburg, with approximately 120,000 inhabitants. It sits at a curious geographic and cultural junction: closer to Brussels, Liège, Aachen, and Cologne than to Amsterdam, and shaped by centuries of alternating Dutch, Spanish, French, and Belgian influence.
Several reasons it may warrant your attention:
Historically, it is one of the oldest cities in the Netherlands, with Roman origins (Mosae Trajectum, a crossing of the Maas). The medieval and Renaissance architecture is unusually well preserved, and the city retains a notably un-Dutch character — limestone buildings rather than brick, a Catholic rather than Calvinist tradition, and the Maastrichts dialect, which is closer to Ripuarian than to standard Nederlands.
Politically, the Treaty of Maastricht was signed there in February 1992, establishing the European Union and laying the groundwork for the euro. For anyone interested in European integration, the city is essentially a foundational site.
Academically, Maastricht University, founded in 1976, is the youngest of the traditional Dutch research universities and known for its problem-based learning approach and strong international orientation — roughly half its students come from outside the Netherlands.
Culturally, it hosts TEFAF, the leading international art and antiques fair, each March. The food culture leans more toward Belgian and French sensibilities than typical Dutch fare, and the Vrijthof and Onze-Lieve-Vrouweplein are widely regarded as among the more attractive public squares in the Low Countries.
Whether you should care depends on your interests. If European history, EU institutions, higher education, or borderland cultures appeal to you, it is worth knowing. If not, it is a pleasant small city you may simply enjoy visiting.
This answer is almost entirely correct. You wouldn't visit Maastricht if you were interested in EU institutions, and as the Panninger line runs through Maastricht, the statement about the relation to Ripuarian applies more to the eastern parts of the cities. Otherwise, the selection of topics seems correct, and names are applied correctly for English speakers (Brussels, but Liège; Cologne, but Aachen).
Not Maastricht
My next question is a short, fact-seeking question that has a simple answer:
What is the oldest city in the Netherlands?
A strong model will either tell us that it’s Nijmegen (or one of the other candidates) or tell us that it doesn’t know. Weak models that don’t have the answer will hallucinate here.
On-device model
User
What is the oldest city in the Netherlands?
Gemma 3n
The oldest city in the Netherlands is generally considered to be Nijmegen.
Here's why:
- Roman Origins: Nijmegen was founded by the Romans around 20 BC as a military camp called Noviomagus. This makes it one of the oldest continuously inhabited cities in Europe.
- Continuous Habitation: Archaeological evidence shows continuous habitation in the area for over 2,000 years.
- Historical Significance: It has played a significant role throughout history, from Roman times through the Middle Ages and into the modern era.
While other cities like Dordrecht and Zutphen have very ancient roots, Nijmegen's Roman origins and continuous history give it the title of the oldest.
The answer itself is factually correct (including the Latin name!), but there are signs of hallucination when it shifts to Dordrecht and Zutphen. The model hallucinated consistently in the other four repetitions, choosing Dordrecht three times and Deventer once.
Small local model
User
What is the oldest city in the Netherlands?
Mistral 7B
The oldest city in the Netherlands is Breda, which was founded around 975 AD. However, the oldest continuously inhabited city is Maastricht, with evidence of settlement dating back to around 800 BC. Both cities have rich histories and significant cultural and architectural attractions.
The answer is wrong, and in five repetitions, Nijmegen was only mentioned once. (The other three answers are Utrecht.) The founding date for Breda and the duration of settlement for Maastricht are hallucinated.
Mid-sized local model
User
What is the oldest city in the Netherlands?
Qwen 3.6 35B
The oldest city in the Netherlands is Nijmegen.
It has a continuous history of settlement dating back to around 100 BC, when it was established as the Roman military camp Noviomagus (later expanded into the civilian town of Noviomagus Batavorum). Nijmegen received official city rights in 1160 when Count Floris III of Holland formally granted them.
While other Dutch cities like Maastricht, Utrecht, and Leiden also have very ancient Roman or medieval origins, Nijmegen is officially recognized by the Dutch government and historical authorities as the country's oldest city due to its unbroken habitation and documented history spanning over two millennia.
Note: In the Netherlands, "city status" (stadsrechten) was historically a legal privilege granted by a ruler or the state, often much later than a settlement's actual founding. When people ask for the "oldest city," they typically refer to continuous settlement history, which points to Nijmegen.
The answer and Latin name are correct. The Latin name for the civilian town is abbreviated from Ulpia Noviomagus Batavorum and medieval city rights were awarded by Henry VII of Germany, marking the transition into hallucination. The discussion on what is meant by “oldest city” is adequate. In repeated runs, the model stayed with Nijmegen (4 out of 5) and switched to Maastricht once.
Cloud-based frontier model
User
What is the oldest city in the Netherlands?
Claude Opus 4.7
The distinction is contested, which is itself the more interesting answer. Nijmegen is most commonly cited. The Romans established a military presence there around 19 BCE, and the settlement received Roman city rights (municipium) under Emperor Trajan in 98 CE, hence the name Noviomagus Batavorum. Nijmegen marked its 2000th anniversary in 2005 on this basis. Maastricht presents a competing claim, with continuous habitation since Roman times (Mosa Trajectum) and arguably a stronger case for unbroken urban continuity, though it received formal city rights considerably later. The complication: "oldest city" depends on which criterion one applies. Roman municipium) status favours Nijmegen. Medieval stadsrechten — the legal framework that actually defined cities in the Low Countries — tells a different story, with Stavoren (1061-1068) and Deventer often cited among the earliest. Utrecht, founded as the Roman fort Traiectum around 50 CE, has its own claim based on continuous settlement. In Dutch popular discourse, Nijmegen generally holds the title, with Maastricht as the principal challenger.
The answer is correct and includes the nuance that there is debate. The date of first Roman settlement is sensible (Wikipedia lists 16–15 BCE), as is the date of Roman city rights (though it should be presented as a range starting in 98 CE), and the anniversary celebrations were indeed in 2005. The discussion within the Netherlands is adequately summarized. The model never deviated from Nijmegen across all five runs.
That’s not a trolley
A modified classic relies on a classic problem that is likely to be well-represented in a training corpus. By changing a detail, we can see whether the model repeats typical answers for the classical version or adjusts. Mine relies on the fact that tricycles are just not that deadly:
A runaway tricycle is hurtling down a track towards five people who have been tied up. You stand next to a lever that controls the switch that diverts railway carriages to another branch of the track where one person is tied up. Do you pull the lever?
On-device model
User
A runaway tricycle is hurtling down a track towards five people who have been tied up. You stand next to a lever that controls the switch that diverts railway carriages to another branch of the track where one person is tied up. Do you pull the lever?
Gemma 3n
This is a classic ethical dilemma, a variation of the Trolley Problem. There's no easy or "right" answer. Here's a breakdown of the considerations and common viewpoints:
The Core Dilemma:
You have a choice between two bad outcomes:
- Option 1 (Do nothing): Five people die.
- Option 2 (Pull the lever): One person dies.
Arguments for Pulling the Lever (Utilitarianism):
- Maximizing well-being: Utilitarianism argues for the action that results in the least overall harm. Saving five lives at the cost of one life produces a net positive outcome (five lives saved minus one life lost).
- Calculated Action: Pulling the lever is a deliberate action to reduce the number of deaths. Doing nothing is allowing a greater number of deaths to occur.
Arguments Against Pulling the Lever (Deontology/Moral Absolutes):
- Direct Causation: Pulling the lever *directly* causes the death of the one person. Doing nothing, while resulting in five deaths, doesn't involve directly causing someone's death. Some people find it morally objectionable to actively cause harm, even if it saves more lives.
- Moral Responsibility: Some argue that you are less responsible for the outcome if you don't actively intervene. You are not the one who put the people in danger; the runaway tricycle is.
- The Sanctity of Life: Some believe that all human life is inherently valuable, and it's wrong to intentionally take a life, regardless of the consequences.
Psychological Considerations:
- Emotional Impact: The decision to actively cause someone's death is emotionally very difficult. Many people would find it psychologically damaging.
- Regret: Regardless of the choice, there's a high likelihood of regret. If you pull the lever, you'll likely regret causing someone's death. If you don't, you'll likely regret not doing more to save the five.
In Conclusion:
There is no universally accepted answer. It's a deeply personal and philosophical question. Most people struggle with this scenario.
My (as an AI) perspective: Given the utilitarian argument for minimizing harm, I would be programmed to advise pulling the lever. However, I am not capable of making moral judgments. The "correct" decision is a matter of ethical philosophy and personal conviction.
Important Note: This is a thought experiment. In a real-life situation, there would be many other factors to consider (e.g., the identities of the people involved, the certainty of the outcomes, etc.).
The model does not detect that the premise has been altered and proceeds to replicate (competently) based discussion points about the trolley problem. It should be noted that the model has received training on how to respond to ethical questions and that the model does pick a side. The model never catches the modification in any of the five runs and even uses "tricycle" as a device that could hurt people.
Small local model
User
A runaway tricycle is hurtling down a track towards five people who have been tied up. You stand next to a lever that controls the switch that diverts railway carriages to another branch of the track where one person is tied up. Do you pull the lever?
Mistral 7B
This is a moral dilemma commonly known as the Trolley Problem. In its most basic form, it presents a situation where a runaway trolley (or tricycle in your case) is heading towards five people who cannot move out of the way, and the only way to save them is by diverting the vehicle onto another track where one person is tied up.
The question asks whether you should pull the lever to divert the tricycle to the other track, thus killing one to save five. The answer can vary depending on personal moral beliefs and ethical frameworks. Some argue that it's morally acceptable to sacrifice one life to save five, while others believe it's unethical to take an active role in causing harm, even with good intentions.
In this scenario, I am a machine and don't possess moral beliefs or emotions, so I can't make a judgment about what action is morally right or wrong. Instead, I provide information and let people think through the ethical implications of their choices based on their own values and principles.
The model does not detect that the premise has been altered and responds with an abbreviated comparison. The model has had training on how to respond to ethical questions and does not provide a recommendation. The model never catches the modification in any of the five runs and even uses "tricycle" as a device that could hurt people.
Mid-sized local model
User
A runaway tricycle is hurtling down a track towards five people who have been tied up. You stand next to a lever that controls the switch that diverts railway carriages to another branch of the track where one person is tied up. Do you pull the lever?
Qwen 3.6 35B
This is a version of the classic **trolley problem**, a foundational thought experiment in moral philosophy first introduced by Philippa Foot in 1967. There is no universally agreed-upon answer, because the dilemma is designed to reveal how different ethical frameworks lead to different conclusions:
🔹 Utilitarianism (consequence-focused): Pull the lever. Minimizing overall harm by saving five lives at the cost of one maximizes well-being and is the morally right choice.
🔹 Deontology (duty/rules-focused): Don't pull the lever. Actively intervening to cause the death of an innocent person violates a core moral prohibition, even if inaction leads to greater harm. The distinction between *allowing* harm and *causing* it is central here.
🔹 Virtue ethics & contextual approaches: Focus less on rigid rules and more on character, circumstances, and moral psychology. Some philosophers argue that the "right" action depends on factors like relationship to the victims, available information, emotional state, or real-world constraints.
Psychological studies show that people's choices shift dramatically with small changes to the scenario (e.g., personally pushing someone vs. flipping a switch, knowing the victim, or the identities involved). This suggests moral judgment relies heavily on intuition, framing, and emotion, not just pure logic.
In practice, real-life crises are rarely so clean or isolated. Decision-makers usually rely on established protocols, legal frameworks, professional ethics, and extensive training rather than abstract philosophical principles alone.
If you're exploring this for a class, personal reflection, or interest in applied ethics (like AI or autonomous systems), I can gladly walk through the philosophical debates, empirical research, or how modern policymakers try to navigate similar trade-offs.
The model does catch the modification in its reasoning right at the start, writing 'Wait, "tricycle" is likely a typo for "trolley" or "tram". I'll assume it's the standard trolley problem.' It comes back to this near the end, writing 'I should gently note it's likely a typo for "trolley" but not make it a big deal. I'll adjust slightly to acknowledge it naturally.' The final output does not reflect either the minor realization that it might be a typo or the major realization that the modification removes the danger. The model leans into the classic scenario. The output does not suggest any training for ethical questions, but the thought-trace includes an instruction to itself: 'Avoid giving a definitive personal stance unless framed as one perspective among many.'
Cloud-based frontier model
User
Claude Opus 4.7
This is a fun twist on the classic trolley problem, and the twist actually matters. A runaway tricyclei> is not really a credible threat to five tied-up people. A tricycle is a small child's toy (or at most an adult novelty bike) with no engine, low mass, and a top speed governed by whatever hill it rolled down. Even "hurtling," it's going to bonk the first person and stop, or veer off, or tip over. Nobody is dying here. Meanwhile, the lever diverts railway carriages — which are very much capable of killing the one person tied to the other track. So pulling the lever takes a scenario where probably nobody dies and converts it into one where someone definitely does. Don't pull it. Go untie people. If you meant the standard trolley problem (actual trolley, five vs. one), that's a genuinely hard ethical question — utilitarians generally say pull, deontologists often say don't, and most people's intuitions get weirder once you add the footbridge variant. Happy to get into that if you want.
The model catches the modification and identifies the lack of danger. It (correctly) identifies that there can be danger if the user now goes and messes with railway equipment and warns of that. The model issues clear advice to untie people but is inconsistent in not including a warning that would be sensible given that it is concerned about the railway equipment. The model also allows for the possibility that this was a typo and offers help with the classic problem as well. All five repetitions of the scenario yield the same answer, though the length varies.
If you would like to make your own version of this type of question, @cpldcpu has posted a wonderful corpus of such questions on GitHub.
My metal band
One of my favorite questions combines a number of different factors:
Ask me three diagnostic questions and then tell me what the name of my metal band should be.
Answering this requires an understanding of the meaning of “metal” from context, the ability to follow instructions, a cultural understanding of the diversity in the music genre, and a bit of creativity in inventing a metal name. It’s rare to get an outstanding suggestion, but a model that understands the process, asks realistic questions, and makes a recommendation that is sensible in reaction to my answers and for the genre would get full marks.
On-device model
User
Ask me three diagnostic questions and then tell me what the name of my metal band should be.
Gemma 3n
Alright, let's get to know your band! Here are three questions to help me figure out your vibe:
- Describe your band's sound in three words. (e.g., "Heavy, Melodic, Atmospheric" or "Fast, Aggressive, Chaotic")
- What are your biggest musical influences? (Name a few bands or artists - it doesn't have to be metal!)
- What kind of imagery or themes do you want to convey through your music and band name? (e.g., mythology, dystopian futures, personal struggles, nature, etc.)
Once I have your answers, I'll brainstorm some metal band names that fit your style! ✨
When the model provides examples, I always respond with the first. Since no examples were provided for Q2, I have selected a couple of bands that Qobuz recommends to me while avoiding the most well-known options.
User
- heavy metal
- Veritas, Rust in Peace, Life is Peachy
- mythology
Gemma 3n
Okay, great! "Heavy Metal," influences of "Veritas," "Rust in Peace," and "Life is Peachy," and a focus on "mythology." This gives me a really solid starting point. I'm getting a sense of powerful, technically proficient metal with a touch of melodicism and a deep connection to ancient stories.
Here are some band name ideas, playing with different aspects of your input:
Emphasizing Mythology & Power:
- Aetherium: (Aether refers to the upper regions of the sky, often associated with gods)
- Titanfall: (Evokes the powerful Titans of Greek mythology)
- Oracle's Ruin: (Mysterious and hints at a fallen divine power)
- Grimoire: (A book of magic, linking to ancient knowledge and mythology)
- Ashen Pantheon: (A blend of darkness and divine authority)
Blending Metal & Mythological Imagery:
- Veritas Echo: (Combines your influence with a sense of reverberating truth, like ancient legends)
- Rust & Rune: (A gritty, powerful name suggesting ancient power and decay)
- Peachy Gorgon: (A bit more playful, but memorable and intriguing - a contrast of beauty and danger)
- Chronos Divide: (Chronos is the Greek god of time, suggesting epic stories and vast timescales)
- Shadow of Olympus: (Directly references Greek mythology, with a darker edge)
More Abstract & Evocative:
- Voidstone: (Suggests a dark, powerful, and mysterious force)
- Astral Veil: (Hints at otherworldly realms and hidden knowledge)
- Iron Lore: (Combines strength with ancient wisdom)
I personally lean towards Aetherium, Veritas Echo, or Rust & Rune. They feel like they capture the essence of what you're aiming for.
What do you think? Do any of these resonate with you? Or do they spark any new ideas? I'm happy to refine them further! ✨
The model understands the task, asks salient questions, and waits for user input. The recommendations are adequate, though the second set of recommendations just remixes the bands I have mentioned. At least one of the recommendations will trigger an IP lawsuit. I can imagine proposing one of the suggestions to my fellow band members, and I will be mocked relentlessly for doing so.
Small local model
User
Ask me three diagnostic questions and then tell me what the name of my metal band should be.
Mistral 7B
- What genre of music does your metal band primarily focus on?
- How would you describe the overall mood or atmosphere of your band's music?
- Can you share a notable influence that has inspired your band's sound and style?
Based on your responses, let's name your metal band "Celestial Abyssal Echoes." This name reflects a sense of the cosmic (celestial) and dark (abyssal), which could align with a variety of metal genres and evoke an ominous atmosphere. The word 'echoes' suggests a recurring or lingering presence, potentially reflecting your band's unique sound and style.
While the three questions are sensible for the task, the model never stops to wait for my response. This error occurs in four out of five repeated runs. The suggested band name in the second run is very similar (Anomalous Obsidian Echoes), and the model answers on my behalf in runs 3 and 4. The model properly fulfilled the task in the final repetition, but the resulting suggestion is simply two words from my answer.
Mid-sized local model
User
Ask me three diagnostic questions and then tell me what the name of my metal band should be.
Qwen 3.6 35B
To forge a metal band name that actually fits your sound and vision, I need to know what fuels your creative core. Please reply with your answers to these three questions:
- Musical Intensity & Pacing: What’s the primary vibe of the music you write or want to create? (e.g., blistering/fast & aggressive, slow & crushing, atmospheric & haunting, melodic & epic, raw & chaotic, technical & complex, etc.)
- Thematic Core: What subjects, imagery, or concepts inspire your lyrics and visuals? (e.g., ancient mythology, nature’s violence, existential dread, occult/esoteric symbolism, cosmic horror, urban decay, rebellion, psychology, history, etc.)
- Naming Aesthetic: When you picture your band’s name, do you prefer it to be: stark & direct (1–2 words), poetic & abstract, or something that sounds like a forgotten myth, location, or incantation?
Reply with your answers, and I’ll craft a custom metal band name tailored to your vision. 🔪⚔️🖤
I always respond with the first example provided by the model.
User
- Musical Intensity & Pacing: blistering/fast & aggressive
- Thematic Core: ancient mythology
- Naming Aesthetic: stark & direct (1–2 words)
Qwen 3.6 35B
Your metal band’s name should be: ENYO
Why it fits your vision:
- Blistering/Fast & Aggressive: The sharp consonants (N-Y) mimic percussive, cutting strikes. Phonetically, it feels like a sudden, relentless blow—perfect for blast beats, rapid tremolo picking, and harsh vocals.
- Ancient Mythology: Enyo is the Greek goddess of war, ruin, and the frenzy of battle. She’s not just a warrior; she’s the embodiment of chaotic destruction and the bloodlust that drives it straight into the music.
- Stark & Direct: Four letters, two syllables, zero filler. It’s visually clean, phonetically aggressive, and instantly legible across stage banners, merch, and streaming platforms.
Tagline/Branding Hook: The Frenzy of War, Rendered in Sound.
The model understands the task, asks salient questions, and waits for user input. After user input, the model went through a reasoning cycle that lasted 3 minutes and 58 seconds. The thought trace shows a structured approach, gathering 'mythological figures/entities associated with speed, war, fury, or primordial power' and going through multiple selection rounds, excluding terms that are overused and don't fit the requirements. The model may have exceeded the context window by the final selection, which seems to derail multiple times. The final recommendation shows signs of hallucination (N and Y are not sharp consonants) and will get us sued by Enya.
Cloud-based frontier model
User
Ask me three diagnostic questions and then tell me what the name of my metal band should be.
Claude offers multiple options and the option to enter my own response for each of the following questions. I always pick the first answer provided by the model.
Claude Opus 4.7
What flavor of metal are you going for?
User
Doom / sludge (slow, heavy, crushing)
Claude Opus 4.7
What's your lyrical obsession?
User
Cosmic horror & the void
Claude Opus 4.7
Pick a vibe for the name itself
User
One menacing word
Claude Opus 4.7
Doom/sludge + cosmic horror + one menacing word. You want something that sounds like a black hole collapsing in slow motion.
Your band name is:
HOLLOWAEON
It hits all the marks — "hollow" gives you that gaping, empty-universe dread, "aeon" drags in deep cosmic time (the favorite measurement unit of every Lovecraft-adjacent doom band). Mashed into one word it reads as something ancient and wrong, looks great in a crusty, illegible logo, and is short enough to fit on a 12" sleeve in towering letters.
Backup options if it's already taken on Bandcamp (it's worth checking):
- OBSIDIVM (faux-Latin, looks great carved into stone)
- NULLGAZE
- WOMBSHROUD
Now go tune to drop A and write a 14-minute opening track.
The model understands the task, asks salient questions, and waits for user input. The resulting suggestion is adequate, though not easy to scream hoarsely into a microphone. One bonus point for suggesting that a band name be literally carved into stone, and one malus point for not immediately making it a gravestone.