“Think before you speak. Read before you think.”
Here I am again waxing poetic (check out the video) about artificial intelligence and video analysis. This time around, I am exploring how AI can be used to create smart transcriptions. By “smart” I mean transcriptions that not only contains what was said, but who said it and when. This becomes very useful with long meeting recordings where you are only interested in certain subjects discussed by certain people. Rather than having to watch or scan through the entire recording, smart transcriptions allow you to immediately go to only the parts that interest you.
My demo solution consists of several components. I have a desktop application that watches a folder for new video files and when one appears, it sends it off to the AWS AI Transcription engine for processing. Once the engine has accepted the file, it returns a job Id. The processing takes some time so rather than sitting around waiting, I developed a second application that runs when the transcription (i.e. Job Id) is ready. That application runs on my public Linux server and is kicked off by a webhook invocation. The invocation contains a reference to the completed Job Id which tells me where to find the transcription results. Those results are then examined and the relevant information is sent to an Avaya Spaces room. A “real solution” would connect the results to the video file, but that’s more work than I am ready to take on for this blog article.
To see all of the above in action, please take a look at my latest Cheapo-Cheapo Productions video:
My interest in AI is taking me to some very interesting places. What began as simple image analysis has blossomed into exploring solutions that use AI to solve common and very real problems.
These endeavors also reinforce my belief in cloud services and composable solutions. While I had to do some significant coding to pull these demos together, developing my own AI engine and collaboration platform are undertakings far beyond my humble abilities. Thankfully, there are smart companies out there that understand the need to wrap their products in publicly accessible APIs. So, rather than reinvent wheel after wheel, I take what I need to create my own unique offerings.
As always, thank you for reading and watching. Feel free to reach out to me if you want to dig deeper. I am always happy to speak geek.