Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

0 votes
4.2k views
in Technique[技术] by (71.8m points)

How to stream audio from microphone to Google Speech-to-Text in Python using WebRTC

I'm looking for documentation about transcribing audio streaming data coming from WebRTC using Google Cloud Speach-To-Text. I'm using aiortc as a library in Python to handle the video and audio stream coming from a client web app.

Here is a snippet of the class that I'm using to process the audio data.

class AudioTransformTrack(MediaStreamTrack):
        kind = "audio"
    
        def __init__(self, track):
            super().__init__()
            self.track = track
    
        async def recv(self):
            frame = await self.track.recv()
            data_np = frame.to_ndarray().astype(dtype='float32').reshape(1920, )
            # print("data_np.shape:", data_np.shape)
            y_16k = librosa.resample(data_np, 48000, 16000)
            audio_data = y_16k.astype(dtype='int16').tobytes()
            return frame

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome To Ask or Share your Answers For Others

1 Reply

0 votes
by (71.8m points)
等待大神答复

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
OGeek|极客中国-欢迎来到极客的世界,一个免费开放的程序员编程交流平台!开放,进步,分享!让技术改变生活,让极客改变未来! Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

...