[Project-Notes] Scribe - STT Editor

I’m currently working with Automatic Speech Recognition models as part of my thesis. I thought it would be fun to create an editor front end to work with these models, as I read more of them. Additionally, this would be a good opportunity to test AI Coding tools. For this I’m using OpenCode along with it’s free models.

The way I would like to record this is, for each session and update, the goal, surprises by the model, some parts of code that I found interesting, or not good as well as updates made to the prompts.

Note: I began this without knowing a lot about “Agentic Coding”, Agents.md, skills etc. Additionally, I have not yet made the code public, as I admittedly, need to review lot’s of code, check for any personal info leaks, if any at all and restructure it to be more concise.

Long term goal: Nothing concrete, but the direction is to create something useful and beautiful like Obsidian . For personal use.

Log #1:

Additional context, at this point, I’m not familiar with Web-Developement, and have a basic understanding of HTML/CSS. Very limited experience using Javascript. Another reason why I’ve chosen to not open source the code just yet. (Note if the video below isn’t showing up- click here)

For starters there were a couple of things I decided:

I already had a vision of how things should look like from some editors I’ve found in different projects by chance (I will credit/link them once I remember). The basic design initially was to setup a three column layout:

Additionally, I wanted to playback the audio at precise word that the model would be unsure of, or rather places where I think seemed unusual. (An example of this is in the video).

The positively surprised by the following: