Fully Funded PhD position in generative AI and expressive speech synthesis for social inclusion KTH Royal Institute of Technology

JG
Joakim Gustafsson
Wed, Oct 4, 2023 9:04 AM

The School of Electrical Engineering and Computer Science (EECS) at the KTH Royal Institute of Technology announces a fully funded Ph.D position in machine learning for conversational AI at the division of Speech, Music, and Hearing (TMH).

ABOUT KTH
KTH Royal Institute of Technology in Stockholm has grown to become one of Europe’s leading technical and engineering universities, as well as a key center of intellectual talent and innovation. We are Sweden’s largest technical research and learning institution and home to students, researchers and faculty from around the world. Our research and education covers a wide area including natural sciences and all branches of engineering, as well as in architecture, industrial management, urban planning, history and philosophy.

The doctoral position is offered by the division of Speech, Music and Hearing (https://www.speech.kth.se/). It is an internationally prominent research group within speech technology and related areas with a heritage that ranges back to the early 50s. It was one of the first research groups of its kind, and is likely the oldest that is still active. The group is in the forefront of fundamental research in areas such as speech analysis, analysis of human communicative behaviours, audiovisual synthesis and generation, speech perception and understanding, and multimodal dialogue systems.

PROJECT DESCRIPTION
The task of the candidate is to develop and evaluate the usefulness of a generative AI and expressive
speech synthesis in Augmentative Communication Technology for individuals with communication disabilities.
As the AI voice system is designed for users who input text using gaze trackers the project also involves
using Large Language models to speed up the text input given the previous context.
The system will leverage the KTH spontaneous TTS used in the video below and with more samples here :
www.speech.kth.se/tts-demos/http://www.speech.kth.se/tts-demos/

Supervision: The doctoral student will be supervised by
Joakim Gustafson (https://www.speech.kth.se/~jocke/)
and
Éva Székely (https://www.linkedin.com/in/eva-szekely-48459528a/)

The starting date for the positions is flexible, but as soon as possible.

What we offer

  • Workplace that focus on working conditions, gender equality, diversity, study environment
  • A monthly salary of around 3.000 EUR, and benefits such as 6 weeks paid vacation, health insurance, paid parental leave and occupational pensions.

QUALIFICATIONS
The candidate must have a degree in Computer Science or related fields.  In order to succeed as doctoral student at KTH you need to be goal oriented and persevering in your work. During the selection process, candidates will be assessed upon their ability to work both independently and to collaborate with others. Other important traits are openness toward interdisciplinary research, creativity and a structured approach. Good command of spoken and written English is required. Expertise in machine learning/deep learning and good programming skills are strongly desirable. Experience and interest in areas like speech technology, signal processing, natural language processing, linguistics, perception is also considered as asset. After the qualification requirements, great emphasis will be placed on personal qualities and personal suitability.

HOW TO APPLY

The application should include:

  1. Curriculum vitae.
  2. Transcripts from University/College.
  3. Brief description of why the applicant wishes to become a doctoral student.

More information here:
https://www.speech.kth.se/vacancies/

The application documents should be uploaded using the KTH's recruitment system.
https://kth.varbi.com/en/what:job/jobID:655390/type:job/where:4/apply:1

The application deadline is ** October 10, 2023 **


Joakim Gustafson
Professor, Head of Division, Co-head of Department
KTH Royal Institute of Technology
School of Electrical Engineering and Computer Science
Department of Intelligent Systems
Division of Speech, Music and Hearing (TMH)

The School of Electrical Engineering and Computer Science (EECS) at the KTH Royal Institute of Technology announces a fully funded Ph.D position in machine learning for conversational AI at the division of Speech, Music, and Hearing (TMH). ABOUT KTH KTH Royal Institute of Technology in Stockholm has grown to become one of Europe’s leading technical and engineering universities, as well as a key center of intellectual talent and innovation. We are Sweden’s largest technical research and learning institution and home to students, researchers and faculty from around the world. Our research and education covers a wide area including natural sciences and all branches of engineering, as well as in architecture, industrial management, urban planning, history and philosophy. The doctoral position is offered by the division of Speech, Music and Hearing (https://www.speech.kth.se/). It is an internationally prominent research group within speech technology and related areas with a heritage that ranges back to the early 50s. It was one of the first research groups of its kind, and is likely the oldest that is still active. The group is in the forefront of fundamental research in areas such as speech analysis, analysis of human communicative behaviours, audiovisual synthesis and generation, speech perception and understanding, and multimodal dialogue systems. PROJECT DESCRIPTION The task of the candidate is to develop and evaluate the usefulness of a generative AI and expressive speech synthesis in Augmentative Communication Technology for individuals with communication disabilities. As the AI voice system is designed for users who input text using gaze trackers the project also involves using Large Language models to speed up the text input given the previous context. The system will leverage the KTH spontaneous TTS used in the video below and with more samples here : www.speech.kth.se/tts-demos/<http://www.speech.kth.se/tts-demos/> Supervision: The doctoral student will be supervised by Joakim Gustafson (https://www.speech.kth.se/~jocke/) and Éva Székely (https://www.linkedin.com/in/eva-szekely-48459528a/) The starting date for the positions is flexible, but as soon as possible. What we offer - Workplace that focus on working conditions, gender equality, diversity, study environment - A monthly salary of around 3.000 EUR, and benefits such as 6 weeks paid vacation, health insurance, paid parental leave and occupational pensions. QUALIFICATIONS The candidate must have a degree in Computer Science or related fields. In order to succeed as doctoral student at KTH you need to be goal oriented and persevering in your work. During the selection process, candidates will be assessed upon their ability to work both independently and to collaborate with others. Other important traits are openness toward interdisciplinary research, creativity and a structured approach. Good command of spoken and written English is required. Expertise in machine learning/deep learning and good programming skills are strongly desirable. Experience and interest in areas like speech technology, signal processing, natural language processing, linguistics, perception is also considered as asset. After the qualification requirements, great emphasis will be placed on personal qualities and personal suitability. HOW TO APPLY The application should include: 1. Curriculum vitae. 2. Transcripts from University/College. 3. Brief description of why the applicant wishes to become a doctoral student. More information here: https://www.speech.kth.se/vacancies/ The application documents should be uploaded using the KTH's recruitment system. https://kth.varbi.com/en/what:job/jobID:655390/type:job/where:4/apply:1 The application deadline is ** October 10, 2023 ** ----------------- Joakim Gustafson Professor, Head of Division, Co-head of Department KTH Royal Institute of Technology School of Electrical Engineering and Computer Science Department of Intelligent Systems Division of Speech, Music and Hearing (TMH)