As some of you may be aware, I’ve been a huge lover of text to speech (TTS) content for a long while, having first stumbled across TTS videos on YouTube in 2013. I can’t quite remember the first TTS video I watched, other than the fact that it featured Microsoft Sam. This was before I’d officially entered the world of smartphones and got a computer of my own, so I was confined to an Xbox with no screen reader for browsing YouTube.

At the very end of 2020, I decided that I’d do what I’ve been wanting to do for years and become a member of the TTS community (or TTSC for short) with the goal of eventually making my own TTS content. After all, I’m not the only blind person out there who’s into TTS. Chris Wright, formerly known as BlindGamer95, also creates TTS videos. It is worth noting, however, that because I’m without any eyesight, my content will be quite different to that of creators such as Thunderbirds101 and AT88TV, both of whom I mentioned in my 5 people I’d like to get to know in the Text to Speech community post. This post will explain exactly how my TTS content will work. That way, if I get a comment like: “Hey, why doesn’t this video have any images like a proper TTS video should?”, I can just point them to this post.

So, let’s get going!

Audio only

There is next to nothing in the way of a screen reader accessible video editor out there. Not within my price range, anyway; money is basically nothing right now. That’s why, unfortunately, my text to speech content will contain no images or other visual effects. Normally, in something like an error video, the audio of a TTS voice reading the error will be synced with an image of the error, generated by a tool such as Atom Smasher’s Error Message Generator. The same goes for funny/weird sign videos. However, due to the reason I cited above, none of that is possible for me. Instead, you will only hear the voice reading the error or sign.

Sounds before errors

Normally, sounds aren’t cued unless a TTS voice malfunctions after raging at an error or there’s an OS switch, such as upgrading from Windows 95 to Windows 98 or downgrading from Windows XP to Windows 2000. In case of the former, the respective error sound of the operating system related to the error, such as Windows 95’s chord sound, will be cued. In case of the latter, the shutdown sound of the current operating system, followed by the startup sound of the new operating system, and its login sound if applicable, will be cued. However, I’ll have to make up for my lack of visuals with extra audio. Therefore, if I’m doing an error video, a single OS-related sound will be played before the error is read. For instance, if the current operating system in use is Windows XP, the critical stop sound will play. If it’s Windows 2000, 98 or 95, the chord sound will play. If Microsoft Sam should rage so hard that he flies off the face of the planet, either a small section of a sound will be looped to create a glitch/computer crash effect, or a massive explosion sound will play. It all depends on what best fits the mood.

I haven’t yet figured out what I’m going to do for stuff like sign videos, Christmas specials etc, so if you have any ideas, feel free to let me know in the comment section down below this post.


I hope this post helps you understand how, and why, my text to speech content will be different to that of other creaters in the TTSC. The reason I’ve been putting off doing TTS content for so long, besides not having the knowledge or tools to do so years ago, is because I’ve always been afraid that the TTSC would simply cast me out or think nothing of me for not being able to meet their standards and expectations.

If you want to know when I start making TTS content, which will hopefully be very soon, you can subscribe to my YouTube channel and follow me on Twitter.