An Efficient and Streaming Audio Visual Active Speaker Detection System – Apple Machine Learning Research
An Efficient and Streaming Audio Visual Active Speaker Detection System – Apple Machine Learning Research