OPEN JOB POSITION - POST-DOCTORAL SCHOLARSHIP ON COMPUTER VISION AND NATURAL LANGUAGE PROCESSING - USP - SÃO PAULO - BRAZIL

 

Knowledge area: Computer Science / Computer vision

Project Title: New methods for image captioning: a framework based on computer vision and natural language processing 

Specific area: Computer vision

Job start: November 2022

Supervisor: Roberto M. Cesar Jr.

Institution: University of São Paulo  - USP

AddressRua do Matão, 1010, Cidade Universitária, São Paulo, Brasil.

Deadline for application: August 26st 2022

 

Inscription e-mail: thematic.data.science.usp@gmail.com

Subject: Application to POST-DOCTORAL SCHOLARSHIP ON COMPUTER VISION AND NLP

 

Description of the project

 

Scene description is a process that aims to associate one or more textual sentences to an image. Concerning urban infrastructure, this is a technique that allows the description of urban scenarios, such as the characterization of sidewalks in terms of size, the identification of objects and the positional relationship they have with each other, among others. In recent years important advances have been achieved in this task through the use of deep neural network techniques. These advances are facilitated by the greater availability of GPUs and large data sets.

 

Furthermore, advances achieved by neural networks have been mainly obtained in individual modalities such as vision, language or sound. In many cases, real-world problems have components that are embedded in more than one modality - as can be the scenarios of Urban Informatics.

 

In this sense, this project aims at developing computational methods to improve the processes of description of urban scenes. As object of study, we intend to explore remote-sensing and street-level images. The project will involve the use of computer vision techniques associated with natural language processing. The main idea is to use the textual language framework to improve the generated descriptions.

 

About the institution

The University of São Paulo is one of the best ranked Universities in Latin America, being considered one of country's more prestigious educational institutions. The Data Science Group at IME-USP is a traditional machine learning research working on the field for more than 20 years with strong international collaboration.

 

The selected candidate will be funded by a FAPESP fellowship with the one of the following conditions:  Initial funding of 1 year, fellowship R$ 90K per year (approx. US$ 24K / year PPP) plus overhead for travel expenses such as attending to conferences.  More information is available at http://www.fapesp.br/en/5427 The grant may also cover expenses for moving to São Paulo/Brasil (including flight tickets).

 

Required Skills

PhD degree with strong background in mathematical modelling and programming (e.g. Computer Science, Engineering, Physics, Math). Research experience and publications in one or more of the following areas: image processing, computer vision and natural language processing. Oral and written communications skills (English).

 

Application details

Please send the following documents to:

 

Inscription e-mail: thematic.data.science.usp@gmail.com

Subject: Application to POST-DOCTORAL SCHOLARSHIP ON COMPUTER VISION AND NLP

- CV

- Link to ORCID, Google Scholar, or ResearcherID

- Summary of doctoral thesis and other relevant works

- Two recommendation letters from former supervisors or collaborators.

 

 

References

 

[1] Simao Herdade et al., Image captioning: Transforming objects into words. Em: Advances in Neural Information Processing Systems, 32 (2019).

 

[2] Aishwarya Kamath et al., MDETR - Modulated Detection for End-to-End Multi-Modal Understanding. Em: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV). Out. de 2021, pp. 1780– 1790.

 

[3] Duo Wang e Salah Karout, Fine-grained Multi-Modal Self-Supervised Learning. Em: arXiv preprint arXiv:2112.12182 (2021).