data:image/s3,"s3://crabby-images/26c2f/26c2fdc588a11a8b121fe7a3c308e9fe2b4194ea" alt="Ffmpeg extract audio from webm"
This API will be making use of FFmpeg to do the actual conversion itself. Simply put, we’re going to make use of an Azure function to build a simple API, which will do the work of converting a WEBM file to a WAV file for us.We’re going to use FFmpeg to convert the Microphone Audio in WEBM format to an audio file in WAV format, so we can pass that file to The Azure Speech to Text Cognitive Services.WEBM is not a supported format for Azure cognitive services. This is because, the Azure cognitive service only recognizes audio which is either in WAV or OGG formats.When we try to pass this audio recorded in WEBM format to Azure cognitive services so as to get it converted from Speech to Text we get an error as Unsupported File Format.Whenever we record the audio in the Microphone control of Power Apps it always gets recorded in the WEBM format.Converting the HTML file to PDF (most widely used document format for business processes).Creating HTML file with the text obtained from output of Cognitive services post conversion.Creating a Speech file in SharePoint which has the audio recorded in Microphone.In addition to this we are going to add more power to the Power Automate solution by also. The main focus of this blog however is to understand how to design and configure a Power Automate solution to bring about conversion of Speech(Audio in Microphone) to Text with additional capabilities like creating an HTML file with the text obtained post conversion. Create a Power Automate (Flow) to create an HTML file, using the text obtained from the output of th.Create an Azure Function to convert audio captured in Power Apps from WEBM to WAV format using FFmpe.Design a Canvas App with The Microphone Control to capture Audio.Since I had to work with a number of components like Power Apps, Power Automate, Azure function, FFMpeg codec and Azure cognitive services to create this solution, I had to divide my blog into 3 parts.
data:image/s3,"s3://crabby-images/92744/9274485aee41a67bc542697d7255209832517b99" alt="ffmpeg extract audio from webm ffmpeg extract audio from webm"
data:image/s3,"s3://crabby-images/783cb/783cb2f733ef4fac1520ac43904897432dbbac00" alt="ffmpeg extract audio from webm ffmpeg extract audio from webm"
data:image/s3,"s3://crabby-images/fed08/fed084a34492d46fc0e25a9ac857b54be1b917dd" alt="ffmpeg extract audio from webm ffmpeg extract audio from webm"
Recently I wrote a nice blog here on how to Convert the Audio recorded from Microphone Control of Power Apps to Text by configuring a Power Automate solution which consumes Azure Cognitive Services.
data:image/s3,"s3://crabby-images/26c2f/26c2fdc588a11a8b121fe7a3c308e9fe2b4194ea" alt="Ffmpeg extract audio from webm"