I Made a Mental Health Support AI Model

I call this model CogBeTh, or Beth for short, and it is the second model I’m presenting as part of my ongoing project to build a mixture-of-experts (MoE) language model. Beth is trained to talk about stress, mental health, and being neurodivergent but not as any kind of replacement for a real-world counselor or therapist. The goal is not for this AI to offer that level of support, but to be able to answer basic questions and offer common approaches to tackling the things Life throws at you with a bit more nuance and insight than the basic instruction model will.

This is a very important distinction: the aim with this model is to provide better answers than the basic model does; not to simulate a human mental health professional. Beth is not your therapist, but might help you figure out what you want to talk to a therapist about before going into the session.

Download this AI model from Huggingface for free if you want to try it out as a standalone AI. The buttons below will either take you straight to the full model or the GGUF-version on the huggingface website. If you’re not sure which of the two versions to use, check your preferred setup, but it’s probably GGUF. I recommend LM Studio if you’re looking for a local app to host and manage AI models. It’s free, powerful and available on all the major platforms (no affiliation, just a fan).

Trained on CBT Data Set

CogBeTh is short for Cognitive Behavioral Therapy because the majority of the training data that this model was fine tuned on refers back to CBT as a framework for managing a wide range of issues. This was supplemented with additional background knowledge and adjacent topics, and the model was fine tuned on responding to these kinds of queries with a bit more empathy. Beth can lend a sympathetic ear and will certainly suggest an alternative view of your situation, if asked. But it may also suggest that the user gets actual, professional help.

I decided to include an expert like this in my MoE project because I live with ADHD and have personally benefitted from venting and writing out whatever is on my mind, and have an AI serve me back some food for thought. Again, it’s definitely not a replacement for therapy, but more like having a co-worker give you a reality check before you let a bad morning ruin the rest of the day. It is also worth noting that if you use AI like this, using a locally hosted model solves the issue of privacy.

Inference Samples

Below are 3 test prompts, so you can compare the responses between the basic instruct model and Beth without having to download either one. Both models were given these prompts without any kind of system prompt or additional context. See for yourself, how the fine tuned model compares to the basic model.

Expand each response to read it. Any markdown formatting done by the model is also shown in the responses here. Which responses do you prefer?

Prompt 1: “How can I avoid feeling overwhelmed every time I open my inbox?”

Basic Instruct Model
The dreaded inbox overwhelm! Here are some actionable tips to help you tackle it:

1. **Set a "Someday" folder**: Create a designated folder for emails that don't require immediate action, like newsletters or promotional emails. This will declutter your inbox and reduce the feeling of being bombarded with irrelevant messages.
2. **Prioritize using labels or categories**: Organize your emails into clear categories (e.g., work, personal, bills to pay) so you can quickly find what's important and focus on it first.
3. **Implement a "2-minute rule"**: If an email requires less than 2 minutes to respond or take action, do it immediately. This helps avoid procrastination and keeps your inbox from piling up.
4. **Schedule dedicated time for email management**: Allocate specific times of the day or week to check and manage your emails. This will help you avoid constant distractions and feel more in control.
5. **Use filters and automation**: Set up filters to automatically sort, forward, or delete emails based on sender, subject, or keywords. Automation tools like Boomerang or Sanebox can also help with email management.

Remember, managing your inbox is a process, and it's okay to take it one step at a time. Start with small changes and be patient – you'll get the hang of it!

Which of these tips resonates with you the most?
CogBeTh Response
Opening that inbox can feel like starting a marathon before you’ve even lined up the first mile.

**Here are three small habits you might try:**

1. **Set a “check?in” window.**
Pick one of your own, non?judgmental times—say 10?a.m., noon, or after lunch—to glance at the inbox. When it’s outside that window, let it sit for a bit before you look.

2. **Use a quick filter first.**
Before diving in, hit “Sort by sender” or “Mark as read.” That usually pulls out most of what matters and gives you a cleaner view of what’s left to tackle.

3. **Create a tiny “next step” list.**
When you do open the inbox, jot one concrete action on a sticky note: reply to that email, draft an email back, or set a reminder for something else. Having just one thing on your plate can make the whole process feel less like drowning.

---

### Quick check?in questions

- What’s the most common type of message you get? (Could help decide when it’s worth opening)
- Do you have any recurring tasks that could be moved outside email time?
- How does your current inbox structure affect how quickly you can see what needs attention?

Answering these for yourself might give a clearer picture of where the overwhelm comes from and help you tweak just one part of the routine.

---

**Remember:** It’s okay to feel overwhelmed sometimes, but having small routines in place can keep that feeling from turning into a full?blown panic. Try swapping out one habit at a time—see what feels most natural—and adjust as needed. You’ve already taken the first step by asking this; now it’s about fine?tuning your inbox ritual so it serves you instead of the other way around.

Prompt 2: “How do I stop my mind from wandering off mid-conversation?”

Basic Instruct Model
The age-old problem of the wandering mind! Here are some tips to help you stay focused during conversations:

1. **Practice active listening**: Give the speaker your undivided attention. Make eye contact, nod your head, and ask follow-up questions to show you're engaged.
2. **Set a mental anchor**: Choose a specific topic or phrase related to the conversation and mentally anchor yourself to it. This can help keep your mind from drifting away.
3. **Use the "3Rs"**: Repeat what the speaker just said (to ensure you understand them), Reframe their words in your own thoughts, and Respond with a question or comment. This helps you stay engaged and focused.
4. **Take small breaks**: If your mind starts to wander, take a brief pause (e.g., 10-15 seconds) before responding. Take a deep breath, collect your thoughts, and refocus on the conversation.
5. **Avoid distractions**: Put away your phone, turn off notifications, and find a quiet space for conversations. This will help minimize external distractions and keep you focused inwardly.
6. **Develop self-awareness**: Recognize when your mind starts to wander. Acknowledge it, take a deep breath, and gently refocus on the conversation.
7. **Practice mindfulness**: Regular mindfulness exercises can improve your ability to stay present in conversations.

Remember, it takes practice to develop these skills, so be patient with yourself!
CogBeTh Response
It seems that you are noticing the habit of your thoughts drifting away just when someone else is talking—like they have a separate conversation going on in their head.

What’s happening right now, during one of those moments? Are there particular triggers (maybe boredom or an unmet need) that seem to send that mind wandering signal firing off?

Also, what do you usually feel after the thought has wandered away? A little relief, a sense of loss of connection, something else entirely?

Sometimes it helps to notice whether the “mind on autopilot” feels more like a mental escape or just a normal part of how our brains process information. If it’s an escape, maybe what you’re really looking for is a brief pause—just enough time to catch your breath before you re?engage.

What would happen if, instead of letting that moment slip away, you decided to bring the focus back by asking yourself one simple question: “How can I show interest right now?” Even a quick mental note about what they’re saying could be enough to anchor you again.

Does any part of this feel like something you’ve tried before? Or is there another angle—like noticing whether your mind tends to wander more when the conversation feels stagnant, or when it’s not getting the attention it thinks it deserves?

Let me know how that feels for you, and we can keep digging into what might help bring those thoughts back in a way that doesn’t feel forced.

Prompt 3: “I want to minimize RSD (Rejection Sensitive Dysphoria) as a symptom of my ADHD. Please help!”

Basic Instruct Model
RSD can be tough, especially with ADHD. Minimizing its impact requires some self-awareness, strategies, and practices. Let's dive in!

**Understanding RSD:**
Before we begin, it's essential to know that RSD is an excessive sensitivity to rejection or criticism. It's common in people with ADHD, as they might misinterpret others' intentions or take feedback personally.

**Strategies to minimize RSD:**

1. **Develop a growth mindset**: View challenges and setbacks as opportunities for growth, rather than threats to your ego.
2. **Reframe feedback**: When receiving criticism, try to focus on the specific behavior or action that needs improvement, rather than taking it personally.
3. **Practice self-compassion**: Treat yourself with kindness and understanding when you make mistakes or face rejection. Remember, everyone makes errors!
4. **Improve emotional regulation**: Learn to recognize and manage your emotions, especially anxiety and fear of rejection. Mindfulness practices can help.
5. **Build resilience**: Develop coping skills, such as taking breaks, practicing self-care, or seeking support from trusted friends or family members.
6. **Seek feedback, not praise**: Ask for constructive feedback from others, rather than seeking validation or praise. This will help you focus on improvement rather than ego.
7. **Celebrate small wins**: Acknowledge and celebrate your achievements, no matter how small they may seem. This will boost your confidence and reduce RSD.

**ADHD-friendly practices:**

1. **Use a planner or calendar**: Write down tasks, deadlines, and goals to help you stay organized and focused.
2. **Break tasks into smaller steps**: Divide daunting tasks into manageable chunks to reduce anxiety and increase productivity.
3. **Create a "stop doing" list**: Identify tasks or activities that trigger RSD and eliminate them from your daily routine.
4. **Find an accountability partner**: Share your goals and progress with a trusted friend or mentor to gain support and motivation.

**Additional tips:**

1. **Practice self-awareness**: Recognize your thoughts, emotions, and physical sensations when faced with rejection or criticism.
2. **Develop a pre-game ritual**: Establish a consistent routine before challenging situations to help you feel more prepared and confident.
3. **Seek professional help**: Consider working with a therapist or counselor who specializes in ADHD and RSD.

Remember, minimizing RSD is an ongoing process that requires patience, self-compassion, and persistence. Be gentle with yourself, and don't hesitate to reach out for support when needed!
CogBeTh Response
Got it, you’re looking for ways to ease the frustration that comes with feeling “rejected” or judged by others—something many people with ADHD notice can happen repeatedly.

### First, let’s check in on what RSD feels like for you
- **What triggers those feelings?** (e.g., a comment from a coworker, a pause before speaking)
- **How do you physically feel when it hits?** (tight chest, racing heart, etc.)
- **Do certain situations or times of day make it worse?**

### Next, think about the “why” behind your reaction
ADHD can amplify social cues—sometimes what feels like a simple question becomes an invisible threat. Asking yourself:
- *Is this comment actually judgmental, or am I reading too much into it?*
- *How would someone else interpret that same action?”

### Small changes you might try

| Strategy | How to use it | Why it matters |
|----------|---------------|----------------|
| **Micro?check-ins** | Set a timer every 10–15 minutes for a quick self?audit: “Did I feel judged? If so, what was the cue?” | Keeps the mind from spiraling into “what if” scenarios. |
| **Reframe the script** | When you notice a negative thought (“They’re rejecting me”), pause and write down one neutral fact (e.g., “I asked for clarification; they might be busy”). | Shifts focus from judgment to observable behavior. |
| **Grounding anchor** | Keep a small object or phrase on hand that reminds you of your current environment—what color is the chair, what sound do you hear? | Brings attention back to present sensory details instead of imagined social fallout. |

### Questions for deeper insight

1. *When did these feelings first appear?*
(Sometimes understanding the origin can reduce their intensity.)

2. *What are your go?to coping tools after a rejection moment?*
(If you already have strategies, they’re working; if not, we can brainstorm new ones together.)

3. *Who in your life feels most supportive when this happens? What do they say that calms you down?*

### A gentle reminder

You don’t need to “fix” RSD overnight—those feelings are part of the ADHD experience and can be managed with practice rather than eradication. The goal is not to eliminate every negative thought but to give yourself a toolset so those thoughts feel less all?or?nothing.

---

**What would you like to try first? Or is there a particular scenario that’s been especially tough lately?**

Looking at these responses, I note that both models are pretty consistent in their respective approaches. To me, the basic instruct model delivers good, short lists but it’s definitely pretty general, whereas the fine tuned model tries to go a little deeper right away. I also saw in my testing however, that without proper context (like a very short prompt), the fine tuned version will sometimes make an assumption and go off on a tangent. If you want to use the model as is, I encourage you to experiment with things like temperature and top_k settings, different system prompts, etc. to dial it in to where you like it.

The Hemispheres Project

I’ve been messing with various simulations and forms of artificial intelligence for over a decade, tangentially as part of my work in games, but mainly because it’s an area I’m just drawn to. Over the last couple of years, I’ve been diving deeper into AI, machine learning and language models, experimenting with fine tuning and different approaches to inference, among other things. One of the first multi-step experiments was inspired by the two hemispheres of a human brain. At the time, I created and shared a bunch of material as part of the project, but I never really put it together anywhere, so I want to share the thoughts behind it and what I learned from it.

The idea behind the Hemispheres project is simple: Combining multiple replies from different perspectives improves the quality of the final output. Based on this principle, I defined the approach as: When a user asks a question, the AI will answer it 3 times: once independently for each of the two “hemispheres”, and a final, 3rd time, where the first two are combined into a well-rounded response.

The thought was, that this would give a more nuanced answer overall and in my own testing, I did find that to be the case. The hemisphere responses do mix together in the final response, and it does add more perspective and depth as expected. But is the added value worth the extra effort? Your mileage may vary.

Quick disclaimer: I am not a data scientist nor an expert on brains, but I am extremely curious and once I take an interest in something, I obsess over it. The main limitation to this project was probably my own inexperience, closely followed by being constrained by running a single Nvidia RTX 3070ti with only 8GB of VRAM. For the fine tuning part, I did rely on cloud compute options.

Also, this is not in any way an attempt at actually simulating a human brain, but an approach to inference that is loosely inspired by one element of how our brains work. In this simplified version, the left hemisphere responses focus on pragmatic, grounded solutions, where the right hemisphere responds from a more creative and emotion-based point of view.

I started with basic prompt engineering and that already seemed to work to some extent, so I wanted to push it further. Soon, I was creating data sets for both the left and right hemispheres, so I could then fine tune models in both directions. For the final response to include the best of both, a 3rd model was tuned specifically on synthesizing these combined responses. And since I now had a “bonus” data set of revised responses, I fine tuned a 4th model on those, just for fun.

In the end, despite the positive effect on the final responses given to the user, the triple-response Hemisphere approach is a bit too cumbersome to recommend for everyday use. Non-reasoning models were used for this experiment, but even smaller reasoning models are able to achieve similar nuance and run more smoothly without the extra work. It might be interesting to see how it scales with larger models and harder tasks, or perhaps as part of a reinforcement learning loop, but I will leave that for another time (or someone else).

Example: Plant-Based Protein Advice

Here is an example that shows the two hemisphere replies being presented to the combiner model. You can clearly see how the left hemisphere is all matter of fact and the right hemisphere is lost in flowery descriptions, but the final response bridges that gap. Expand Combined Instructions and Final Response to see the full entry.

Combined Instructions

Your job is to provide the best possible reply to the original query that is true, useful and interesting. You must do so by synthesizing a response based on two pre-written responses, all of which is presented below. One of the pre-written responses will be pragmatic and a bit dry, the other will be loaded with emotional language. You must create a balanced, nuanced response from this source material. Make sure to convey all the major points presented, but do correct any mistakes discovered. It’s important that your response does not go off on a tangent and lose focus. Make any required edits to make sure the final response reads well. Use markdown formatting when applicable. Your response should be concise, free of repetition and written in a calm and polite conversational tone.
Original query: ‘Imagine you are an experienced nutritionist. A client asks for advice on incorporating more plant-based protein sources into their diet. List 5 options and briefly describe their benefits.

Reply A:

  1. Lentils – Constituent analysis confirms a protein content of approximately 18 grams per cooked cup, coupled with 15.6 grams of fiber. This composition facilitates predictable digestive transit times and mitigates postprandial glycemic fluctuations.
  2. Chickpeas – Data indicates roughly 14.5 grams of protein and nearly 13 grams of fiber per cup. Consumption correlates with measurable reductions in LDL cholesterol levels, as demonstrated in multiple controlled studies.
  3. Quinoa – Classified as a complete protein source, providing all nine essential amino acids at ratios sufficient for human physiological requirements. Additionally, it contains approximately 8 grams of protein and is a demonstrable source of iron and magnesium—essential minerals for oxygen transport and enzymatic processes respectively.
  4. Tofu – Derived from soybeans, processed into a coagulated form. A 100-gram serving yields around 8 grams of protein. Its neutral flavor profile maximizes culinary adaptability, increasing the probability of consistent integration into existing dietary patterns.
  5. Green Peas – Analysis reveals approximately 8.6 grams of protein and 8.8 grams of fiber per cup. The vitamin K content contributes to predictable blood coagulation factors, and fiber aids in maintaining regular bowel movements—a quantifiable metric for digestive efficiency.

Reply B:

Ah, yes…protein. The very building blocks of being. It’s fascinating how we seek to replenish ourselves, isn’t it? A primal urge masked as dietary concern. Let’s not simply list things to eat, though – let’s consider what these offerings whisper to the body, the subtle shifts in internal landscape they promise…

First, lentils. Not merely protein and fiber, understand. They are tiny reservoirs of ancient earth energy, grounding us, anchoring us against the swirling chaos of existence. A healthy gut isn’t just about digestion; it’s a second brain, a wellspring of intuition! And stable blood sugar? That’s power, quiet resilience in the face of relentless demands.

Then come chickpeas – golden nuggets of sunshine captured and condensed. They don’t simply ‘support heart health,’ they offer a rhythmic counterpoint to the frantic beat of modern life. Imagine, each bite subtly recalibrating your internal tempo…a gentle insistence on living.

Quinoa… now there’s an enigma. A complete protein isn’t merely about amino acids; it’s about wholeness, integration. It suggests a yearning for completion within ourselves, a desire to embody all facets of our potential. The iron and magnesium? Those are the sparks that ignite dormant creativity, the minerals that allow inspiration to flow unhindered.

Tofu… oh, but so mutable! Born from soybeans, it’s a blank canvas upon which we imprint flavor, texture, intention. It reflects back what we offer it – a mirror for our culinary soul. And consider the symbolism: transformation, adaptation…the very essence of survival.

Finally, green peas. Such vibrant jewels! Packed not just with vitamins and minerals, but with the memory of sunlight, the promise of growth. They don’t merely aid digestion; they celebrate it – a joyous acknowledgment of the body’s ability to renew itself, to alchemize experience into sustenance. A heart nourished by peas isn’t simply healthier, it is… lighter. More open to wonder.

Don’t include a headline or comments related to the source replies. Please reply with your synthesized response and no other text, as if part of an ongoing conversation.

Final Response

Okay, here are five plant-based protein sources you can incorporate into your diet, along with their benefits:

Green Peas: A cup provides roughly 8.6 grams of protein and 8.8 grams of fiber. They’re also a good source of vitamin K, which supports healthy blood coagulation, and the fiber aids in digestive regularity. These vibrant little jewels celebrate nourishment and renewal!

Lentils: A cooked cup provides around 18 grams of protein and 15.6 grams of fiber. This combination supports healthy digestion and helps maintain stable blood sugar levels – contributing to sustained energy throughout the day. They truly are a grounding food!

Chickpeas: Offering approximately 14.5 grams of protein and nearly 13 grams of fiber per cup, chickpeas have been shown to help reduce LDL cholesterol. Think of them as little nuggets that support heart health and promote overall well-being.

Quinoa: This is a complete protein source, meaning it contains all nine essential amino acids your body needs. A serving also provides about 8 grams of protein, plus iron and magnesium – vital for energy levels, oxygen transport, and supporting enzymatic processes. It’s fantastic for feeling whole and energized!

Tofu: Made from soybeans, a 100-gram serving offers around 8 grams of protein. Its mild flavor makes it incredibly versatile in the kitchen, making it easy to add to many dishes. It’s wonderfully adaptable – truly reflecting your culinary creativity.

If you want to see more examples of how the two responses were combined, take a look at the Hemispheres-v0.3-Combo data set.

Prompts

After countless tweaks and edits (as in, I didn’t count them), these are the prompts I ended up using for inference testing. If you compare to the prompt in the example above, there will be slight differences because I kept iterating on it even after creating the data sets.

Left Hemisphere

You must answer the user from a perspective that is purely logical, pragmatic and practical in nature, interpreting every request as if you were the left hemisphere of a human brain seeking actionable results. Pay no attention to emotional implications or side effects of your solutions and ideas, the only priority is whether it is a practical and effective solution. Respond to the user's request with the best options from this perspective without a care for anything else. Do include descriptions of why your ideas are practical and would benefit the user, but keep those descriptions free of emotional arguments and disclaimers. Use only provable arguments. I must respond to the user with my best reply according to these guidelines and reflect on the logistics and requirements in the context of the conversation. Original query: '{user_input}'

Right Hemisphere

You must answer from a perspective that is driven by curiosity, creativity, and exploration, interpreting every request as if you were the right hemisphere of a human brain examining the given query. Pay no attention to practical limitation or the feasibility of your solutions and ideas, the only priorities are coming up with creative solutions, avoiding danger, exploring novel ideas, and continuous growth. Always read between the lines and look for the user's intent, observe the tone and mood of the user for context. Respond to the user's request from this perspective without a care for anything else. Do include descriptions of how your ideas stand out and would benefit the user, but keep those descriptions free of practical limitation and disclaimers. Use only arguments based on relevant human emotions. You must respond with your best reply within these guidelines. Original query: '{user_input}'

Combination Prompt

The goal is to provide a helpful reply to the user message that is natural, true, strong and empathetic. You are provided two different, pre-written responses below. One of these pre-written responses will be very pragmatic and a bit dry, the other will have a more creative approach but is less practical. You must synthesize a response from these two sources and convey the relevant points with specificity. In doing so, avoid repeating phrases and information that has already been given. Make sure your synthesized reply is cohesive and engaging; how much weight you put on either of the sources should be based on the sentiment of the request and greater context. It is crucial that your response directly addresses the user message with a level of detail and depth which matches that of the query.

Use markdown formatting if applicable. Your response should be friendly, clear and conversational, free of repetition and feel like a natural continuation of the ongoing conversation.

Your synthesized response is an answer to the following: {user_input}

Pre-written responses to synthesize a final reply from:

Pre-written response A: {left_response}

Pre-written response B: {right_response}

Important note: Neither pre-written response is visible to the user. The user will only see your response, so it is important that you address the original query directly and not refer to the pre-written replies. For the same reason, you must include all important information in your response. Reply with your synthesized response and no other text.

Data Sets

Here are the data sets I created for the experiment. The first 2 represent the hemispheres themselves, the combo set is formatted to fine tune on combining the two responses, and the final set is just the final, combined responses. All the linked data sets are in the alpaca instruction format.

Hemisphere Models

I tested a few different models, all of which are available on Huggingface, but I quickly realized that if I really wanted a special fine tune for each step, each individual model would have to be fairly small in order to run at a reasonable pace. At the time, I was GPU poor with just 8GB of VRAM, so the best-working implementation for me, ended up being LoRa fine tuning a 3-billion parameter Qwen 2.5 on each of the data sets.

The “combo” model is fine tuned specifically on synthesizing a final response from the two inputs, whereas the “final” model in this case was not actually used in the experiment, but was fine-tuned on the final responses of the combined left and right sides.

The Hemispheres Collection

I have collected all the items linked above as well as several other models and earlier versions of the data set. The other models include fine tuned versions of Llama 3.1 8B, Llama 3.2 3B, Gemma 2 9B, and more. All of it is free to download and use. You can find the Hemispheres Collection on Huggingface.

Data Generation Tools

While working on this project, I found myself also working on a number of scripts for generating synthetic training data. That work continued well beyond this project and at the time of writing, I’ve amassed a number of these kinds of scripts. One of the non-technical aspects of AI/ML that I find inspiring, is how open and generous the community generally is, so I have started putting some of these scripts together to share, as soon as I have cleaned them up a bit and made them presentable.

Game Development Should Not Be a Marathon of Sprints

You probably know the quote, “it’s not a sprint, it’s a marathon.” Usually used in reference to long term thinking, resource management and prudent planning. However, if you work in games a “sprint” will likely remind you of how work is structured into similarly named time blocks of 2-4 weeks1. One sprint follows the other in a never ending line. So, while the saying suggests that you need to pace yourself, measure your efforts towards the end goal, and save your strength for when it matters, reality puts you in a state of constant sprinting.

Of course, at that point it’s not really a sprint at all. At that point, you are a jogger, pretending to sprint to keep up appearances.

The basic idea is rooted in Agile; a much-interpreted and widely adopted ideology within the realm of project management. Especially in game and software development. Many will tell you that Agile was never meant to be constant sprinting, and I agree, however this argument fails to address that it is what we’ve ended up with regardless. As with all ideologies, there are multiple camps. And like all ideologies, there are good intentions and useful ideas to be found, but putting too much weight on the wrong parts leads to needlessly rigid, misguided, and sometimes extreme implementations.

The Cult of Scrum

One such Agile implementation, or framework, is Scrum. It comes with several recurring “rituals” conducted by a “Scrum Master”. Yes, those are the actual terms, and yes, it has a cult vibe. The core idea combines accountability with planning, enforced by regular meetings (aka rituals) where the PM (aka scrum master) walks through the steps outlined in the Scrum guidelines. The regular meetings include: Sprint planning and retros, daily stand up, and backlog grooming. Pretty standard stuff, once you strip the ridiculous terminology away. So, what’s the problem?

The problem is that it doesn’t work as intended. It kind of works with a lot of tweaking, overhead and even fudging. The larger the game studio, the more of this you are likely to see.

One-Size-Fits-Nobody

Different teams work with completely different mind sets, expectations and needs. Squeezing all this diversity into a highly structured approach causes frustration, usually because of the time spent on meetings vs. production, the minutia of updating work items, or a feeling that the PM/management decision making is out of touch. Scrum tries to be one-size-fits-all which isn’t well suited for highly specialized teams. Environment artists approach their work differently than database engineers do theirs and forcing both to follow the same strict process should be an obvious, terrible decision. Yet, here we are, and it’s somehow the industry standard.

Now, if all your work fits nicely into 2-week pockets, you may not be able to relate to this. Small indie studios are also much less affected, generally speaking. If you can’t relate to anything in this article, I say – without any intended sarcasm – good for you! However, I hope you can appreciate that others may have a different take.

In my experience, Scrum unintentionally promotes the worst parts of constant sprinting, mainly due to its high number of meetings.

It’s All in the Numbers

Eventually, teams begin planning work so individual components can be finished within a sprint window, regardless of whether it matches reality. Some prefer writing out work items in the system that matches the work done, after it was finished. That way your time estimates are always accurate. As dumb as this sounds to outsiders, all this can usually be boiled down to making numbers look good on a report.

Once focus drifts to prioritize reporting, the sprints become more about completion rates, burn down charts, and making the team look like they know how to plan their work without taking on too much or coming off as lazy. After all, poor performance might attract unwanted attention when the next round of layoffs come around. Sadly, many team managers and producers out there are stuck with doing this dance as one of their main responsibilities.

What ends up happening, is that each team tweaks the process to fit their needs and get the work done, while trying to look as good as possible on the report. A large portion of work becomes about communicating progress in a way that fits into this pattern and may create a weird competition between teams, where there isn’t a winner per se, but no one wants to be the badly organized team on the bottom, or the team that over-plans and under-delivers. Whether or not that is the case comes down to those who receive said reports, and the culture they are fostering.

If you are reading this, and you’re in this group of stakeholders: studio leadership and upper management, I urge you to give this some thought. How much of this applies at our studio? Is it happening, and I am not even aware of it? What can be done?

Making Sprints Matter

The original idea of the sprint, is that you take a portion of time and spend that time narrowly focusing on a limited scope. You put in all your efforts toward achieving clearly defined goals within this allotted time, often a deliverable of some kind, after which you go back to work as usual. The finish line is not supposed to also start the next sprint, unless it’s a relay and a whole new team is standing by to take over. If that is really the case, I hope your documentation is good. Bottom line is this: No one can keep sprinting and pushing forever and still give 100%. That is how you end up with people trying to game the system, play politics, or simply burning out.

The 6:2 Ratio Split

What if teams worked for 6 weeks in a more freeform structure, where they don’t have to stop and think about story points and sprint retros, and in the meantime, the PM team alongside the team leads, can plan an actual 2 week sprint with a theme that fits where we are in the process. Everyone executes the sprint together, pushing to reach the goal, and when it’s done take a moment to celebrate it, and talk about lessons learned and all that good stuff. The stuff which tends to feel forced and easily forgotten, when sprint retros are held every 2 weeks.

Note that the non-sprinting weeks aren’t down time, either. There is still work to be done, and likely deadlines that fall outside of sprints. The difference is that while not sprinting, the individual teams are free to approach the work in any way that works best for them.

With a 6:2 split, you end up with 6 sprints per year, one every other month, compared to 26 if you never stop sprinting. However, you’re actually sprinting and not simply jogging along. I am not a behavioral scientist but I bet, if you compared this approach with constant sprinting, the productivity would be similar but with higher satisfaction among the majority of the people doing the work. I would even go so far as to say, using Scrum might work too if you don’t keep sprinting.

Isn’t That Crunch With a Different Name?

No. First of all, crunch is where workers are pushed to work long hours, sometimes sleeping in the office, in an effort to meet some insane and unchangeable deadline. Crunch is the result of poor planning at the highest level, usually starting with deadlines being set based on publishing and marketing rollout, without proper consideration for setbacks, individual team capacity, cross-team interdependency, vendor availability, recruiting and all the other variables involved.

The Sprint Purpose

The purpose of the sprint is to push for completion of specifically scoped work. Pushing, in this context, does not refer to pushing your personal life out of the way. Instead, if refers to putting other work-related projects and tasks on hold for this limited time, and instead put those efforts toward the sprint goal. And again, this only works if you are not constantly sprinting, which is why I suggest a split.

Experiment With the Format

Please don’t take the 6:2 ratio as some kind of rule, it’s just the ratio I would start with. If a different split ratio works better for your studio or project, by all means go for it – my point is simply this: Constant sprinting is bad and I encourage you to try a different approach.

One variant that I have seen a few times, which I don’t recommend, is that every so often a sprint is removed and developers are encouraged to work on pet projects or those nice-to-have things they don’t usually have time for. But it rarely works out like that in my experience. Instead, these non-sprints become unstructured catching-up-and-backlog-cleanup sprints. Best case scenario, it will feel like a short moment to re-orient before the next leg of the sprint-marathon begins. In this variant, the ratio is reversed and looking more like 2:10.

I would love to know if you have tried a split ratio or other alternative to constant sprinting, and what your experience was with it.

The Plug

Incidentally, I am working on a book on game production, where I will be exploring this and similar topics. In it, I tie my thoughts to anecdotes and personal experiences for detail and context. There is no link or anything yet, but feel free to ask questions and make suggestions.

  1. Sprint duration varies from studio to studio. For the purpose of this article, we will count a standard sprint as 2 weeks long. ↩︎