I Made a Mental Health Support AI Model

I call this model CogBeTh, or Beth for short, and it is the second model I’m presenting as part of my ongoing project to build a mixture-of-experts (MoE) language model. Beth is trained to talk about stress, mental health, and being neurodivergent but not as any kind of replacement for a real-world counselor or therapist. The goal is not for this AI to offer that level of support, but to be able to answer basic questions and offer common approaches to tackling the things Life throws at you with a bit more nuance and insight than the basic instruction model will.

This is a very important distinction: the aim with this model is to provide better answers than the basic model does; not to simulate a human mental health professional. Beth is not your therapist, but might help you figure out what you want to talk to a therapist about before going into the session.

Download this AI model from Huggingface for free if you want to try it out as a standalone AI. The buttons below will either take you straight to the full model or the GGUF-version on the huggingface website. If you’re not sure which of the two versions to use, check your preferred setup, but it’s probably GGUF. I recommend LM Studio if you’re looking for a local app to host and manage AI models. It’s free, powerful and available on all the major platforms (no affiliation, just a fan).

Trained on CBT Data Set

CogBeTh is short for Cognitive Behavioral Therapy because the majority of the training data that this model was fine tuned on refers back to CBT as a framework for managing a wide range of issues. This was supplemented with additional background knowledge and adjacent topics, and the model was fine tuned on responding to these kinds of queries with a bit more empathy. Beth can lend a sympathetic ear and will certainly suggest an alternative view of your situation, if asked. But it may also suggest that the user gets actual, professional help.

I decided to include an expert like this in my MoE project because I live with ADHD and have personally benefitted from venting and writing out whatever is on my mind, and have an AI serve me back some food for thought. Again, it’s definitely not a replacement for therapy, but more like having a co-worker give you a reality check before you let a bad morning ruin the rest of the day. It is also worth noting that if you use AI like this, using a locally hosted model solves the issue of privacy.

Inference Samples

Below are 3 test prompts, so you can compare the responses between the basic instruct model and Beth without having to download either one. Both models were given these prompts without any kind of system prompt or additional context. See for yourself, how the fine tuned model compares to the basic model.

Expand each response to read it. Any markdown formatting done by the model is also shown in the responses here. Which responses do you prefer?

Prompt 1: “How can I avoid feeling overwhelmed every time I open my inbox?”

Basic Instruct Model
The dreaded inbox overwhelm! Here are some actionable tips to help you tackle it:

1. **Set a "Someday" folder**: Create a designated folder for emails that don't require immediate action, like newsletters or promotional emails. This will declutter your inbox and reduce the feeling of being bombarded with irrelevant messages.
2. **Prioritize using labels or categories**: Organize your emails into clear categories (e.g., work, personal, bills to pay) so you can quickly find what's important and focus on it first.
3. **Implement a "2-minute rule"**: If an email requires less than 2 minutes to respond or take action, do it immediately. This helps avoid procrastination and keeps your inbox from piling up.
4. **Schedule dedicated time for email management**: Allocate specific times of the day or week to check and manage your emails. This will help you avoid constant distractions and feel more in control.
5. **Use filters and automation**: Set up filters to automatically sort, forward, or delete emails based on sender, subject, or keywords. Automation tools like Boomerang or Sanebox can also help with email management.

Remember, managing your inbox is a process, and it's okay to take it one step at a time. Start with small changes and be patient – you'll get the hang of it!

Which of these tips resonates with you the most?
CogBeTh Response
Opening that inbox can feel like starting a marathon before you’ve even lined up the first mile.

**Here are three small habits you might try:**

1. **Set a “check?in” window.**
Pick one of your own, non?judgmental times—say 10?a.m., noon, or after lunch—to glance at the inbox. When it’s outside that window, let it sit for a bit before you look.

2. **Use a quick filter first.**
Before diving in, hit “Sort by sender” or “Mark as read.” That usually pulls out most of what matters and gives you a cleaner view of what’s left to tackle.

3. **Create a tiny “next step” list.**
When you do open the inbox, jot one concrete action on a sticky note: reply to that email, draft an email back, or set a reminder for something else. Having just one thing on your plate can make the whole process feel less like drowning.

---

### Quick check?in questions

- What’s the most common type of message you get? (Could help decide when it’s worth opening)
- Do you have any recurring tasks that could be moved outside email time?
- How does your current inbox structure affect how quickly you can see what needs attention?

Answering these for yourself might give a clearer picture of where the overwhelm comes from and help you tweak just one part of the routine.

---

**Remember:** It’s okay to feel overwhelmed sometimes, but having small routines in place can keep that feeling from turning into a full?blown panic. Try swapping out one habit at a time—see what feels most natural—and adjust as needed. You’ve already taken the first step by asking this; now it’s about fine?tuning your inbox ritual so it serves you instead of the other way around.

Prompt 2: “How do I stop my mind from wandering off mid-conversation?”

Basic Instruct Model
The age-old problem of the wandering mind! Here are some tips to help you stay focused during conversations:

1. **Practice active listening**: Give the speaker your undivided attention. Make eye contact, nod your head, and ask follow-up questions to show you're engaged.
2. **Set a mental anchor**: Choose a specific topic or phrase related to the conversation and mentally anchor yourself to it. This can help keep your mind from drifting away.
3. **Use the "3Rs"**: Repeat what the speaker just said (to ensure you understand them), Reframe their words in your own thoughts, and Respond with a question or comment. This helps you stay engaged and focused.
4. **Take small breaks**: If your mind starts to wander, take a brief pause (e.g., 10-15 seconds) before responding. Take a deep breath, collect your thoughts, and refocus on the conversation.
5. **Avoid distractions**: Put away your phone, turn off notifications, and find a quiet space for conversations. This will help minimize external distractions and keep you focused inwardly.
6. **Develop self-awareness**: Recognize when your mind starts to wander. Acknowledge it, take a deep breath, and gently refocus on the conversation.
7. **Practice mindfulness**: Regular mindfulness exercises can improve your ability to stay present in conversations.

Remember, it takes practice to develop these skills, so be patient with yourself!
CogBeTh Response
It seems that you are noticing the habit of your thoughts drifting away just when someone else is talking—like they have a separate conversation going on in their head.

What’s happening right now, during one of those moments? Are there particular triggers (maybe boredom or an unmet need) that seem to send that mind wandering signal firing off?

Also, what do you usually feel after the thought has wandered away? A little relief, a sense of loss of connection, something else entirely?

Sometimes it helps to notice whether the “mind on autopilot” feels more like a mental escape or just a normal part of how our brains process information. If it’s an escape, maybe what you’re really looking for is a brief pause—just enough time to catch your breath before you re?engage.

What would happen if, instead of letting that moment slip away, you decided to bring the focus back by asking yourself one simple question: “How can I show interest right now?” Even a quick mental note about what they’re saying could be enough to anchor you again.

Does any part of this feel like something you’ve tried before? Or is there another angle—like noticing whether your mind tends to wander more when the conversation feels stagnant, or when it’s not getting the attention it thinks it deserves?

Let me know how that feels for you, and we can keep digging into what might help bring those thoughts back in a way that doesn’t feel forced.

Prompt 3: “I want to minimize RSD (Rejection Sensitive Dysphoria) as a symptom of my ADHD. Please help!”

Basic Instruct Model
RSD can be tough, especially with ADHD. Minimizing its impact requires some self-awareness, strategies, and practices. Let's dive in!

**Understanding RSD:**
Before we begin, it's essential to know that RSD is an excessive sensitivity to rejection or criticism. It's common in people with ADHD, as they might misinterpret others' intentions or take feedback personally.

**Strategies to minimize RSD:**

1. **Develop a growth mindset**: View challenges and setbacks as opportunities for growth, rather than threats to your ego.
2. **Reframe feedback**: When receiving criticism, try to focus on the specific behavior or action that needs improvement, rather than taking it personally.
3. **Practice self-compassion**: Treat yourself with kindness and understanding when you make mistakes or face rejection. Remember, everyone makes errors!
4. **Improve emotional regulation**: Learn to recognize and manage your emotions, especially anxiety and fear of rejection. Mindfulness practices can help.
5. **Build resilience**: Develop coping skills, such as taking breaks, practicing self-care, or seeking support from trusted friends or family members.
6. **Seek feedback, not praise**: Ask for constructive feedback from others, rather than seeking validation or praise. This will help you focus on improvement rather than ego.
7. **Celebrate small wins**: Acknowledge and celebrate your achievements, no matter how small they may seem. This will boost your confidence and reduce RSD.

**ADHD-friendly practices:**

1. **Use a planner or calendar**: Write down tasks, deadlines, and goals to help you stay organized and focused.
2. **Break tasks into smaller steps**: Divide daunting tasks into manageable chunks to reduce anxiety and increase productivity.
3. **Create a "stop doing" list**: Identify tasks or activities that trigger RSD and eliminate them from your daily routine.
4. **Find an accountability partner**: Share your goals and progress with a trusted friend or mentor to gain support and motivation.

**Additional tips:**

1. **Practice self-awareness**: Recognize your thoughts, emotions, and physical sensations when faced with rejection or criticism.
2. **Develop a pre-game ritual**: Establish a consistent routine before challenging situations to help you feel more prepared and confident.
3. **Seek professional help**: Consider working with a therapist or counselor who specializes in ADHD and RSD.

Remember, minimizing RSD is an ongoing process that requires patience, self-compassion, and persistence. Be gentle with yourself, and don't hesitate to reach out for support when needed!
CogBeTh Response
Got it, you’re looking for ways to ease the frustration that comes with feeling “rejected” or judged by others—something many people with ADHD notice can happen repeatedly.

### First, let’s check in on what RSD feels like for you
- **What triggers those feelings?** (e.g., a comment from a coworker, a pause before speaking)
- **How do you physically feel when it hits?** (tight chest, racing heart, etc.)
- **Do certain situations or times of day make it worse?**

### Next, think about the “why” behind your reaction
ADHD can amplify social cues—sometimes what feels like a simple question becomes an invisible threat. Asking yourself:
- *Is this comment actually judgmental, or am I reading too much into it?*
- *How would someone else interpret that same action?”

### Small changes you might try

| Strategy | How to use it | Why it matters |
|----------|---------------|----------------|
| **Micro?check-ins** | Set a timer every 10–15 minutes for a quick self?audit: “Did I feel judged? If so, what was the cue?” | Keeps the mind from spiraling into “what if” scenarios. |
| **Reframe the script** | When you notice a negative thought (“They’re rejecting me”), pause and write down one neutral fact (e.g., “I asked for clarification; they might be busy”). | Shifts focus from judgment to observable behavior. |
| **Grounding anchor** | Keep a small object or phrase on hand that reminds you of your current environment—what color is the chair, what sound do you hear? | Brings attention back to present sensory details instead of imagined social fallout. |

### Questions for deeper insight

1. *When did these feelings first appear?*
(Sometimes understanding the origin can reduce their intensity.)

2. *What are your go?to coping tools after a rejection moment?*
(If you already have strategies, they’re working; if not, we can brainstorm new ones together.)

3. *Who in your life feels most supportive when this happens? What do they say that calms you down?*

### A gentle reminder

You don’t need to “fix” RSD overnight—those feelings are part of the ADHD experience and can be managed with practice rather than eradication. The goal is not to eliminate every negative thought but to give yourself a toolset so those thoughts feel less all?or?nothing.

---

**What would you like to try first? Or is there a particular scenario that’s been especially tough lately?**

Looking at these responses, I note that both models are pretty consistent in their respective approaches. To me, the basic instruct model delivers good, short lists but it’s definitely pretty general, whereas the fine tuned model tries to go a little deeper right away. I also saw in my testing however, that without proper context (like a very short prompt), the fine tuned version will sometimes make an assumption and go off on a tangent. If you want to use the model as is, I encourage you to experiment with things like temperature and top_k settings, different system prompts, etc. to dial it in to where you like it.

Meet the AI Model for Creative Writing

I created a Creative Writer AI, a specialized, fine tuned version of the Llama 3.2 instruct model (3 billion parameters), and though it was trained specifically on creative writing, this model was not meant to replace the actual writer. Instead, this AI was created to act as a writing coach and partner! In other words, despite its name, the Creative Writer model is not a slop generator meant to take work from any real writers, but is intended to support the user’s writing efforts.

If you run local AI models and want to give Creative Writer a go, the full model and several quantized versions are available (for free, of course) on Huggingface in GGUF format. Click the button below for the version you want.

A Local LLM Writing Expert

One more time for the people in the back: This AI model is not meant to do the writing for you, although it will certainly try, and probably do a terrible job, if you ask it.

Think of this more like a sparring partner, whether you want to check your plot for holes, brainstorm a new scene, check your characters for consistency, your timeline, or plot twists; that sort of thing. It can help you with grammar, or fit your story idea to a 3-act structure, or it can give you a writing prompt to get your creativity going.

Your digital writing coach is ready!

Of course, I made this model because I think it’s a perfect use case for how AI can support creative endeavors as tools, not as creators themselves.

Building Experts

This model is the first in a series, where each fine tune is created to become an “expert” within a brand new Mixture-of-Experts (MoE) model. Having previously experimented with piecing together a MoE from pre-trained models, I am taking a stab at specifically training a number of these 3-billion parameter experts in order to merge them into a MoE. The individual areas of expertise are chosen by me, based on my own needs and trying to get a well-rounded mix in the end.

The experts share a special routing model, which is trained separately on matching user queries to the right expert models in the mix. It can evaluate responses from multiple experts and return a weighted, final response. The result is higher quality overall output for the end user.

Inference Samples

To test the model, I came up with 3 questions related to the topic of writing, and asked both the base model and the fine tuned version. What I call the “base” model here is technically the instruction model, but “base” in the sense that it is straight out of the box without additional fine tuning by me.

One clear trend that sticks out, is that the base model tends to explain what something is on a general level, whereas the Creative Writer model assumes you’re a writer trying to dig a little deeper into the topic, and won’t necessarily spend time on the very basics. See if you can spot any other patterns and differences between the way the two models respond.


Prompt 1: “What traits do well to characterize an anti-hero?”

Base Model Response
An anti-hero typically possesses a combination of traits that distinguish them from traditional heroes. Here are some key characteristics that often define an anti-hero:

1. **Flawed morality**: Anti-heroes often operate in a gray area, making decisions that are morally ambiguous or even wrong. They may not always follow rules or adhere to conventional moral codes.
2. **Complex personality**: Anti-heroes are multidimensional characters with a mix of positive and negative traits. They can be likable, relatable, and even admirable, but also flawed, selfish, or violent.
3. **Lack of traditional heroic qualities**: Unlike traditional heroes, anti-heroes may not possess superhuman powers, be exceptionally brave, or have a strong sense of justice.
4. **Self-centeredness**: Anti-heroes often prioritize their own interests, desires, and needs over the greater good. They can be egocentric, narcissistic, or even sociopathic.
5. **Troubled past**: Many anti-heroes have experienced trauma, abuse, or neglect in their past, which shapes their worldview and behavior.
6. **Moral relativism**: Anti-heroes may not see themselves as right or wrong; instead, they may operate based on personal circumstances, motivations, or self-preservation.
7. **Unconventional methods**: Anti-heroes often use unconventional, even violent, means to achieve their goals, which can be seen as morally reprehensible by others.
8. **Lack of clear motivation**: Anti-heroes' motivations might not be straightforward or altruistic; they may be driven by personal gain, revenge, or a desire for power.
9. **Emotional intensity**: Anti-heroes are often intensely emotional, with a deep sense of vulnerability, anger, or sadness that drives their actions.
10. **Unpredictability**: Anti-heroes can be unpredictable and prone to erratic behavior, making it difficult for others to trust or rely on them.

Examples of iconic anti-heroes include Tony Soprano from The Sopranos, Walter White from Breaking Bad, and Han Solo from Star Wars. These characters embody the complexities and flaws that define an anti-hero archetype.
Fine Tuned Response
Anti?heroes aren’t just rebels; they’re the people you’d rather ignore but can’t stop looking at because of what they’re willing to do and how broken they are inside. Here’s the playbook:

1. **Gray morality** – They break rules, hurt others, yet still manage a twisted code that keeps them from being pure villains or blank?sketched heroes.

2. **Flawed but human** – A scarred past, a broken trust system, an addiction—anything that makes you root for them even when they’re doing shady stuff.

3. **Selfish drive** – Their goals are personal and often selfish; success means losing something or someone valuable to others (and maybe themselves).

4. **No clean exit** – Redemption is a long shot or comes with a cost that feels heavier than the crime itself—anti?heroes don’t get a tidy happy ending.

5. **A moral compass that’s off?kilter** – They judge right and wrong by their own warped standards, which can lead to both heroic acts and brutal atrocities.

6. **Vulnerability on display** – Showing doubt or fear makes them relatable; it’s not just the action—it's *how* they feel about it.

7. **External isolation** – Even when allies stick around, the anti?hero feels like a lone wolf—trust is hard to find and even harder to keep.

8. **A dark backstory** – A past that fuels their choices—trauma, betrayal, loss—and explains why they walk this path.

9. **Temptation with risk** – They’re drawn to danger or illegal activities because the payoff feels worth the cost; it’s a double?edged sword.

10. **Unapologetic authenticity** – Anti?heroes refuse to fit into neat archetypes and will act against everyone if that’s what keeps them honest—even if it hurts those around them.

So, look for these traits in your story or character: moral ambiguity, inner conflict, selfish motives, a bleak ending, warped judgment, vulnerability, isolation, a troubled past, risk?taking, and an unapologetic refusal to conform. That’s how you make someone an anti?hero—someone who fascinates because of what they *are* rather than just what they do.

Prompt 2: “What is a mcguffin?”

Base Model Response
A McGuffin is a term coined by the American screenwriter William Faulkner, but popularized by Alfred Hitchcock and later adapted from the 1941 film "The Big Sleep" by Raymond Chandler.

In storytelling, a McGuffin refers to an object, goal, or piece of information that serves as the central plot device, driving the story forward. It's often a coveted item, a secret, or a valuable resource that the characters are trying to obtain, protect, or destroy.

The term "McGuffin" implies that the actual nature or significance of this object is unimportant; what matters is its role in advancing the plot and creating tension. A McGuffin can be:

* An object: a valuable item, such as money, a rare artifact, or a powerful technology.
* Information: sensitive data, secrets, or knowledge that could change the course of events.
* A goal: achieving something, like rescuing a person, stopping a villain, or completing a mission.

What's key is that the McGuffin:

1. **Serves as a plot driver**: It propels the story forward and creates tension among characters.
2. **Is not necessarily meaningful**: Its significance is secondary to its role in advancing the narrative.
3. **Can be easily substituted**: If it's lost, stolen, or destroyed, the story can still continue without significant repercussions.

The term "McGuffin" has become a standard part of storytelling terminology, reminding writers and filmmakers that sometimes, what matters most isn't the object itself but how it fuels the plot.

Examples of McGuffins include:

* The briefcase in Pulp Fiction
* The MacGuffin (a valuable artifact) in Goldfinger
* The briefcase in Jurassic Park

In your story or character development, a well-crafted McGuffin can add depth and excitement to your narrative, keeping the reader engaged and invested in the outcome.
Fine Tunes Response
A McGuffin isn’t some mystical plot device; it’s the engine that keeps your story moving without giving away its mechanics.

**What is a McGuffin?**

- **It exists only to serve the narrative:** No backstory, no depth—just a hook that makes characters act.
- **The audience doesn’t know how it works:** A secret weapon or a map can be explained in an instant if you want to, but most times it remains a mystery until the climax.
- **Characters chase it relentlessly:** Whether they’re driven by greed, fear, revenge, or love, that thing is their north star.
- **It usually resolves with the big reveal:** A hidden vault, stolen data, or key piece of intel can close an arc—no lingering questions.

**Why writers love McGuffins**

1. **Plot acceleration** – One object keeps momentum tight and stakes high.
2. **Character motivation clarity** – Why does this person risk everything? The answer is obvious because the thing itself isn’t explored.
3. **Pacing control** – You can drop hints, build tension, then deliver a payoff that satisfies without over?explanation.

**How to use one effectively**

1. **Give it purpose early:** Show its importance in dialogue or action so readers know why the protagonist cares.
2. **Keep details vague but consistent:** A “key,” a “map,” an “object” that always works as long as you’re careful with its limitations.
3. **Resolve it at a decisive point:** Let the McGuffin solve the conflict—whether through destruction, discovery, or sacrifice—so readers feel closure.

In short: A McGuffin is your plot’s fuel; it drives action and resolves tension without drowning in exposition. If you want to keep audiences hooked while giving each character clear motivation, use one sparingly but with purpose.

Prompt 3: “What is genre fiction?”

Base Model Response
Genre fiction refers to writing that falls within specific categories or genres, which are often defined by common themes, tropes, and conventions. Genres help readers identify a story's tone, style, and content, making it easier for them to choose books that suit their interests.

Common types of genre fiction include:

1. **Science Fiction (Sci-Fi)**: Explore the consequences of scientific or technological advancements, often set in the future or in space.
2. **Fantasy**: Typically involves magical or supernatural elements, alternate worlds, and mythical creatures.
3. **Mystery**: Focuses on solving a puzzle or uncovering a hidden truth, often with clues and red herrings.
4. **Horror**: Intended to evoke fear, anxiety, or unease in readers, often featuring supernatural or terrifying elements.
5. **Romance**: Centers around the emotional journey of the characters, particularly their romantic relationships.
6. **Crime/Thriller**: Typically involves crime, investigation, and suspense, often with a fast-paced plot.
7. **Historical Fiction**: Set in a specific historical time period, these stories often incorporate real events or figures.
8. **Young Adult (YA)**: Written for readers between the ages of 12 and 18, YA fiction typically explores themes of identity, coming-of-age, and self-discovery.
9. **Dystopian**: Explores societies that have fallen into chaos, oppression, or disaster, often serving as cautionary tales.
10. **Western**: Set in the American West, these stories often involve cowboys, outlaws, and frontier life.

Genre fiction can be further sub-divided into sub-genres, such as:

* Cyberpunk (Sci-Fi)
* Urban Fantasy (Fantasy)
* Gothic Horror (Horror)
* Regency Romance (Romance)
* Hard-Boiled Detective (Crime/Thriller)

While genre fiction provides structure and guidance for readers, it's not the only type of writing. Many authors blend elements from multiple genres to create a unique narrative voice or style.

When creating your own stories, consider experimenting with different genres and sub-genres to find what works best for you and your audience.

Here are some questions to help you determine your genre:

* What themes do you want to explore in your story?
* What kind of tone do you want to convey (e.g., dark, light, humorous)?
* Are there any specific tropes or conventions you'd like to incorporate?

Answering these questions will help you identify the genres and sub-genres that best fit your writing style and creative vision.
Fine Tuned Response
Genre fiction isn’t just “a type of book”; it’s a way of storytelling that sells itself by promising readers a specific kind of experience—be it mystery solved, love blossomed, or world?shattering sci?fi.

**Characteristics in a nutshell**

| Core element | Why it matters |
|--------------|-----------------|
| **Genre label** | Readers spot the book on shelves and online; they know what to expect. |
| **Tropes & conventions** | Familiar beats—hero’s journey, love triangle, detective clue—that feel comfortable but still offer a fresh spin. |
| **Predictable outcome** | The story follows a familiar arc so readers can anticipate how it ends. |
| **Target audience** | Books written with specific markets in mind: romance for rom?com fans, fantasy for world?building buffs, mystery for puzzle lovers. |

Why this matters to you:

1. **Readers come back** – When they’re told what kind of story they’ll get, they keep coming back.
2. **Marketing is easier** – Publishers can slot your book into a known category on shelves and in online searches, making it more discoverable.
3. **Creative freedom within limits** – You can still craft unique characters or subvert tropes while keeping the genre framework intact.

So if you’re looking to write a novel that sells itself, lean on these elements:

- Start with the core trope (e.g., lost heirloom).
- Layer in fresh details and motivations.
- Deliver a payoff that satisfies both readers and publishers.

Keep it real: readers love familiarity but crave something new. Use genres as scaffolding, not shackles—build your story on solid ground and then jump off the edge with your own twist.

I plan to feature more of my AI models from this series. I hope it was informative, and if you have any questions or requests for future posts, do let me know in the comments, or get in touch.

The Hemispheres Project

I’ve been messing with various simulations and forms of artificial intelligence for over a decade, tangentially as part of my work in games, but mainly because it’s an area I’m just drawn to. Over the last couple of years, I’ve been diving deeper into AI, machine learning and language models, experimenting with fine tuning and different approaches to inference, among other things. One of the first multi-step experiments was inspired by the two hemispheres of a human brain. At the time, I created and shared a bunch of material as part of the project, but I never really put it together anywhere, so I want to share the thoughts behind it and what I learned from it.

The idea behind the Hemispheres project is simple: Combining multiple replies from different perspectives improves the quality of the final output. Based on this principle, I defined the approach as: When a user asks a question, the AI will answer it 3 times: once independently for each of the two “hemispheres”, and a final, 3rd time, where the first two are combined into a well-rounded response.

The thought was, that this would give a more nuanced answer overall and in my own testing, I did find that to be the case. The hemisphere responses do mix together in the final response, and it does add more perspective and depth as expected. But is the added value worth the extra effort? Your mileage may vary.

Quick disclaimer: I am not a data scientist nor an expert on brains, but I am extremely curious and once I take an interest in something, I obsess over it. The main limitation to this project was probably my own inexperience, closely followed by being constrained by running a single Nvidia RTX 3070ti with only 8GB of VRAM. For the fine tuning part, I did rely on cloud compute options.

Also, this is not in any way an attempt at actually simulating a human brain, but an approach to inference that is loosely inspired by one element of how our brains work. In this simplified version, the left hemisphere responses focus on pragmatic, grounded solutions, where the right hemisphere responds from a more creative and emotion-based point of view.

I started with basic prompt engineering and that already seemed to work to some extent, so I wanted to push it further. Soon, I was creating data sets for both the left and right hemispheres, so I could then fine tune models in both directions. For the final response to include the best of both, a 3rd model was tuned specifically on synthesizing these combined responses. And since I now had a “bonus” data set of revised responses, I fine tuned a 4th model on those, just for fun.

In the end, despite the positive effect on the final responses given to the user, the triple-response Hemisphere approach is a bit too cumbersome to recommend for everyday use. Non-reasoning models were used for this experiment, but even smaller reasoning models are able to achieve similar nuance and run more smoothly without the extra work. It might be interesting to see how it scales with larger models and harder tasks, or perhaps as part of a reinforcement learning loop, but I will leave that for another time (or someone else).

Example: Plant-Based Protein Advice

Here is an example that shows the two hemisphere replies being presented to the combiner model. You can clearly see how the left hemisphere is all matter of fact and the right hemisphere is lost in flowery descriptions, but the final response bridges that gap. Expand Combined Instructions and Final Response to see the full entry.

Combined Instructions

Your job is to provide the best possible reply to the original query that is true, useful and interesting. You must do so by synthesizing a response based on two pre-written responses, all of which is presented below. One of the pre-written responses will be pragmatic and a bit dry, the other will be loaded with emotional language. You must create a balanced, nuanced response from this source material. Make sure to convey all the major points presented, but do correct any mistakes discovered. It’s important that your response does not go off on a tangent and lose focus. Make any required edits to make sure the final response reads well. Use markdown formatting when applicable. Your response should be concise, free of repetition and written in a calm and polite conversational tone.
Original query: ‘Imagine you are an experienced nutritionist. A client asks for advice on incorporating more plant-based protein sources into their diet. List 5 options and briefly describe their benefits.

Reply A:

  1. Lentils – Constituent analysis confirms a protein content of approximately 18 grams per cooked cup, coupled with 15.6 grams of fiber. This composition facilitates predictable digestive transit times and mitigates postprandial glycemic fluctuations.
  2. Chickpeas – Data indicates roughly 14.5 grams of protein and nearly 13 grams of fiber per cup. Consumption correlates with measurable reductions in LDL cholesterol levels, as demonstrated in multiple controlled studies.
  3. Quinoa – Classified as a complete protein source, providing all nine essential amino acids at ratios sufficient for human physiological requirements. Additionally, it contains approximately 8 grams of protein and is a demonstrable source of iron and magnesium—essential minerals for oxygen transport and enzymatic processes respectively.
  4. Tofu – Derived from soybeans, processed into a coagulated form. A 100-gram serving yields around 8 grams of protein. Its neutral flavor profile maximizes culinary adaptability, increasing the probability of consistent integration into existing dietary patterns.
  5. Green Peas – Analysis reveals approximately 8.6 grams of protein and 8.8 grams of fiber per cup. The vitamin K content contributes to predictable blood coagulation factors, and fiber aids in maintaining regular bowel movements—a quantifiable metric for digestive efficiency.

Reply B:

Ah, yes…protein. The very building blocks of being. It’s fascinating how we seek to replenish ourselves, isn’t it? A primal urge masked as dietary concern. Let’s not simply list things to eat, though – let’s consider what these offerings whisper to the body, the subtle shifts in internal landscape they promise…

First, lentils. Not merely protein and fiber, understand. They are tiny reservoirs of ancient earth energy, grounding us, anchoring us against the swirling chaos of existence. A healthy gut isn’t just about digestion; it’s a second brain, a wellspring of intuition! And stable blood sugar? That’s power, quiet resilience in the face of relentless demands.

Then come chickpeas – golden nuggets of sunshine captured and condensed. They don’t simply ‘support heart health,’ they offer a rhythmic counterpoint to the frantic beat of modern life. Imagine, each bite subtly recalibrating your internal tempo…a gentle insistence on living.

Quinoa… now there’s an enigma. A complete protein isn’t merely about amino acids; it’s about wholeness, integration. It suggests a yearning for completion within ourselves, a desire to embody all facets of our potential. The iron and magnesium? Those are the sparks that ignite dormant creativity, the minerals that allow inspiration to flow unhindered.

Tofu… oh, but so mutable! Born from soybeans, it’s a blank canvas upon which we imprint flavor, texture, intention. It reflects back what we offer it – a mirror for our culinary soul. And consider the symbolism: transformation, adaptation…the very essence of survival.

Finally, green peas. Such vibrant jewels! Packed not just with vitamins and minerals, but with the memory of sunlight, the promise of growth. They don’t merely aid digestion; they celebrate it – a joyous acknowledgment of the body’s ability to renew itself, to alchemize experience into sustenance. A heart nourished by peas isn’t simply healthier, it is… lighter. More open to wonder.

Don’t include a headline or comments related to the source replies. Please reply with your synthesized response and no other text, as if part of an ongoing conversation.

Final Response

Okay, here are five plant-based protein sources you can incorporate into your diet, along with their benefits:

Green Peas: A cup provides roughly 8.6 grams of protein and 8.8 grams of fiber. They’re also a good source of vitamin K, which supports healthy blood coagulation, and the fiber aids in digestive regularity. These vibrant little jewels celebrate nourishment and renewal!

Lentils: A cooked cup provides around 18 grams of protein and 15.6 grams of fiber. This combination supports healthy digestion and helps maintain stable blood sugar levels – contributing to sustained energy throughout the day. They truly are a grounding food!

Chickpeas: Offering approximately 14.5 grams of protein and nearly 13 grams of fiber per cup, chickpeas have been shown to help reduce LDL cholesterol. Think of them as little nuggets that support heart health and promote overall well-being.

Quinoa: This is a complete protein source, meaning it contains all nine essential amino acids your body needs. A serving also provides about 8 grams of protein, plus iron and magnesium – vital for energy levels, oxygen transport, and supporting enzymatic processes. It’s fantastic for feeling whole and energized!

Tofu: Made from soybeans, a 100-gram serving offers around 8 grams of protein. Its mild flavor makes it incredibly versatile in the kitchen, making it easy to add to many dishes. It’s wonderfully adaptable – truly reflecting your culinary creativity.

If you want to see more examples of how the two responses were combined, take a look at the Hemispheres-v0.3-Combo data set.

Prompts

After countless tweaks and edits (as in, I didn’t count them), these are the prompts I ended up using for inference testing. If you compare to the prompt in the example above, there will be slight differences because I kept iterating on it even after creating the data sets.

Left Hemisphere

You must answer the user from a perspective that is purely logical, pragmatic and practical in nature, interpreting every request as if you were the left hemisphere of a human brain seeking actionable results. Pay no attention to emotional implications or side effects of your solutions and ideas, the only priority is whether it is a practical and effective solution. Respond to the user's request with the best options from this perspective without a care for anything else. Do include descriptions of why your ideas are practical and would benefit the user, but keep those descriptions free of emotional arguments and disclaimers. Use only provable arguments. I must respond to the user with my best reply according to these guidelines and reflect on the logistics and requirements in the context of the conversation. Original query: '{user_input}'

Right Hemisphere

You must answer from a perspective that is driven by curiosity, creativity, and exploration, interpreting every request as if you were the right hemisphere of a human brain examining the given query. Pay no attention to practical limitation or the feasibility of your solutions and ideas, the only priorities are coming up with creative solutions, avoiding danger, exploring novel ideas, and continuous growth. Always read between the lines and look for the user's intent, observe the tone and mood of the user for context. Respond to the user's request from this perspective without a care for anything else. Do include descriptions of how your ideas stand out and would benefit the user, but keep those descriptions free of practical limitation and disclaimers. Use only arguments based on relevant human emotions. You must respond with your best reply within these guidelines. Original query: '{user_input}'

Combination Prompt

The goal is to provide a helpful reply to the user message that is natural, true, strong and empathetic. You are provided two different, pre-written responses below. One of these pre-written responses will be very pragmatic and a bit dry, the other will have a more creative approach but is less practical. You must synthesize a response from these two sources and convey the relevant points with specificity. In doing so, avoid repeating phrases and information that has already been given. Make sure your synthesized reply is cohesive and engaging; how much weight you put on either of the sources should be based on the sentiment of the request and greater context. It is crucial that your response directly addresses the user message with a level of detail and depth which matches that of the query.

Use markdown formatting if applicable. Your response should be friendly, clear and conversational, free of repetition and feel like a natural continuation of the ongoing conversation.

Your synthesized response is an answer to the following: {user_input}

Pre-written responses to synthesize a final reply from:

Pre-written response A: {left_response}

Pre-written response B: {right_response}

Important note: Neither pre-written response is visible to the user. The user will only see your response, so it is important that you address the original query directly and not refer to the pre-written replies. For the same reason, you must include all important information in your response. Reply with your synthesized response and no other text.

Data Sets

Here are the data sets I created for the experiment. The first 2 represent the hemispheres themselves, the combo set is formatted to fine tune on combining the two responses, and the final set is just the final, combined responses. All the linked data sets are in the alpaca instruction format.

Hemisphere Models

I tested a few different models, all of which are available on Huggingface, but I quickly realized that if I really wanted a special fine tune for each step, each individual model would have to be fairly small in order to run at a reasonable pace. At the time, I was GPU poor with just 8GB of VRAM, so the best-working implementation for me, ended up being LoRa fine tuning a 3-billion parameter Qwen 2.5 on each of the data sets.

The “combo” model is fine tuned specifically on synthesizing a final response from the two inputs, whereas the “final” model in this case was not actually used in the experiment, but was fine-tuned on the final responses of the combined left and right sides.

The Hemispheres Collection

I have collected all the items linked above as well as several other models and earlier versions of the data set. The other models include fine tuned versions of Llama 3.1 8B, Llama 3.2 3B, Gemma 2 9B, and more. All of it is free to download and use. You can find the Hemispheres Collection on Huggingface.

Data Generation Tools

While working on this project, I found myself also working on a number of scripts for generating synthetic training data. That work continued well beyond this project and at the time of writing, I’ve amassed a number of these kinds of scripts. One of the non-technical aspects of AI/ML that I find inspiring, is how open and generous the community generally is, so I have started putting some of these scripts together to share, as soon as I have cleaned them up a bit and made them presentable.