PhD Position in Machine Learning (Document Image Analysis)

Topic: Integration of Visual Aspects of Documents into Large Language Models
Placement: Institute of Artificial Intelligence and Complex Systems (iCoSys), HEIA Fribourg, Switzerland
Starting date: June 1, 2024 (or by arrangement)
Application deadline: April 15, 2024

Project description: Large language models (LLMs) have a high potential for analyzing, recognizing, and validating scanned documents. However, they are mainly focused on the OCR text and do not take into account visual aspects, such as layout, illustrations, etc. that are of fundamental importance for document understanding.

The successful candidate will perform basic research and develop novel methods for efficient integration of visual aspects into LLMs for document understanding. A particular focus will be to obtain explainable results with respect to both visual and textual contents of the documents.

Your profile:

  1. Master of Science in Computer Science (or almost finished)
  2. Strong background in machine learning
  3. Very good programming skills
  4. Very good oral and written communication skills in English; French and/or German are a plus but not a requirement

Application: Please provide in a single PDF file the following items:

  1. Motivation letter (max 1 page)
  2. CV
  3. Contact information of 1 reference person (possibility to include a letter of reference)
  4. Download link to your Master thesis (can be a draft version if not finished yet)

Submit your application to Prof. Dr. Andreas Fischer: