2023-06317 - Research Engineer - Domain Adaptation in NMT
Contract type : Fixed-term contract
Level of qualifications required : Graduate degree or equivalent
Fonction : Temporary scientific engineer
ContextThe chosen applicant will carry out research in the context of the DadaNMT Emergence project funded by Sorbonne Université. The aim of the project is to explore approaches to training MT models (i) for low-resource domains by using out-of-domain data (domain adaptation) and (ii) that are capable of handling texts from different domains (multi-domain models). We will apply our approaches across three domains: film subtitles (using different film genres), news (from different countries) and biomedical data. We will also test across different language pairs, focusing on at least the three following pairs: English-French, English-German and English-Romanian, chosen because of the availability of training and test data across the domains and because they represent three different scenarios in terms of the amount of data available. The three chosen domains have freely available parallel data available: film subtitles from OpenSubtitles (Lison et al. 2018), news data from the WMT news translation shared tasks (Barrault et al. 2019) and biomedical data from the WMT biomedical MT shared tasks (Neves et al. 2019).
AssignmentThe aim of the topic will be to explore directions for the adaptation of machine translation (MT) to low-resource domains and/or the development of MT models capable of handling several domains, while maintaining performance of the model on those domains that are well represented (i.e. avoid catastrophic forgetting). There are several possible directions that could be explored depending on the interest and experience of the candidate, including:
Curriculum learning: how to best introduce training examples such that the model can learn better (e.g. from simpler to more complex examples, transitioning from one domain to another, etc.) (Platanios et al., 2019; Zhan et al., 2021)
Meta-learning: how to best initialise a model in order for it to be able to robustly adapt to new domains, particularly those that have few training examples. (Sharaf et al., 2020; Zhan et al., 2021)
Using large-scale pretrained LLMs to see how well they can be adapted to translation for specific domains. This could either involve conversational LLMs or traditional LLMs. In the first case, potential approaches could include the decomposition of the translation of new examples into composite tasks via chain-of-thought prompting (Wei et al., 2022; Wang et al., 2022) or the training of composite tasks (Bursztyn et al., 2022) each of lower simplicity (e.g. word translation, formulation, reformulation) and also translation refinement using adapted prompts. In the second case, the approach could follow prompt selection and example selection.
References
The main activities include carrying out reading on the proposed topic, experimenting with baselines (previous work) and proposing and implementing new solutions. The work carried out will be presented both within the team and internationally (should the work be accepted as a peer-reviewed conference or workshop).
SkillsWe are looking for an applicant with a good experience in machine learning, natural language processing and a strong interest for linguistics. The applicant must have a good level of written and spoken English.
Benefits packageInria is the French national research institute dedicated to digital science and technology. It employs 2,600 people. Its 200 agile project teams, generally run jointly with academic partners, include more than 3,500 scientists and engineers working to meet the challenges of digital technology, often at the interface with other disciplines. The Institute also employs numerous talents in over forty different professions. 900 research support staff contribute to the preparation and development of scientific and entrepreneurial projects that have a worldwide impact.
Instruction to applyDefence Security : This position is likely to be situated in a restricted area (ZRR), as defined in Decree No. 2011-1425 relating to the protection of national scientific and technical potential (PPST).Authorisation to enter an area is granted by the director of the unit, following a favourable Ministerial decision, as defined in the decree of 3 July 2012 relating to the PPST. An unfavourable Ministerial decision in respect of a position situated in a ZRR would result in the cancellation of the appointment.
Recruitment Policy : As part of its diversity policy, all Inria positions are accessible to people with disabilities.
Warning : you must enter your e-mail address in order to save your application to Inria. Applications must be submitted online on the Inria website. Processing of applications sent from other channels is not guaranteed.