Meta, London, England, United Kingdom, is looking for a Research Scientist Intern, Speech & Audio Technologies (PhD).
About the job
We are looking for Research Scientist Interns to join the Meta AI Speech team in London. Our team creates spoken language technologies to make it faster and easier for people to build community and connect with others around the world. We conduct product-motivated research in ML/AI and design, develop, and deploy state-of-the-art algorithms to the rest of Meta. We work on all aspects of AI for speech and audio processing, including speech recognition, speech synthesis, speaker identification, keyword spotting, and acoustic event detection with an emphasis on multimodal understanding, i.e. by augmenting acoustic information with visual cues or cues from other sensors available on AR devices. Our work is largely focused on the areas of voice interfaces, including speech technologies for Ray-Ban | Meta RayBan smart glasses, Quest 3 mixed-reality headsets, Augmented Reality, the Metaverse, and understanding video on Facebook and Instagram, including transcription, captioning, and content understanding. As a Research Scientist Intern, you will help us develop innovative models and algorithms and apply them to large-scale production speech tasks. Our teams at Meta AI offer twelve (12) to twenty-four (24) weeks long internships and we have various start dates throughout the year. Learn more about our research here.
Research Scientist Intern, Speech & Audio Technologies (PhD) Responsibilities:
- Perform research to advance the science and technology of intelligent machines.
- Develop novel and accurate speech algorithms and systems, leveraging deep learning and machine learning on big data resources.
- Contribute research that can be applied to Meta product development.
- Analyze and improve the efficiency, scalability, and stability of various deployed systems.
- Collaborate with team members from prototyping to production.
Minimum Qualifications:
- Currently has or is in the process of obtaining a PhD degree.
- Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment.
- Experience in C/C++ and Python.
- Experience in deep learning frameworks such as PyTorch, and Tensorflow.
- Research and/or work experience in machine learning, deep learning, and/or speech technology.
Preferred Qualifications:
- Experience manipulating and analyzing complex, high-volume, high-dimensional data from varying sources.
- Proven track record of achieving results as demonstrated by grants, fellowships, and patents, as well as first-authored publications at workshops or conferences such as Interspeech, ICASSP, or similar.
- A strong interest in theoretical and empirical research and in answering hard questions with research.
- Interpersonal experience: cross-group and cross-culture collaboration.
- Ability to stay in touch with the literature of a particular domain and reproduce results if needed.
- Experienced with training deep neural networks for key Speech tasks such as speech recognition, speech synthesis, speech translation, speaker diarization, sentiment analysis, acoustic event recognition, scene understanding, wake word, etc.
- Experience working with other modalities such as vision and text understanding is a plus.
- Intent to return to the degree program after the completion of the internship/co-op.
Read More:Â How to Use Vectors, Tokens, and Embeddings to Create Natural Language Models?
Read More:Â How to Generate Images from Text on Your Mobile Device with MobileDiffusion