Uni Internship Jan to May 2025 - Development of Large Language Models (LLMs) for healthcare use-case

Date: 13 Sep 2024

Location: SG

Company: Synapxe

Synapxe is the national HealthTech agency inspiring tomorrow’s health. The nexus of HealthTech, we connect people and systems to power a healthier Singapore. Together with partners, we create intelligent technological solutions to improve the health of millions of people every day, everywhere.
 
Are you someone who enjoys problem solving, has a creative and curious mind, and strives to create a better and healthier tomorrow? If you say yes to all, do check out our website and find out more about Internship@Synapxe.
 
Join Synapxe as an intern and see how you can contribute in powering a healthier Singapore. We aim to deliver the best experience for all interns, to create exponential growth and paving your future in the tech industry.

 

While remarkable progress has been made in Natural Language Processing (NLP), the full potential of LLMs in healthcare remains untapped. Models such as GPT4 or Llama2 have demonstrated their efficacy in various NLP tasks across industries, but their adoption in the healthcare domain has been limited due to the scarcity of specialized medical data for training, challenges in truth verification, and the potential consequences of errors. Recognizing this pressing need, it is crucial to swiftly prototype and operationalize LLMs in the healthcare sector to maximize their benefits.

 

The objective of the project is to implement end-to-end pipelines to harness the power of open-source LLMs for healthcare use-cases as proof-of-concept (POC). Some possible use-cases include Classification, Summarization, and Question answering. The pipelines may include some of the following steps: data preparation, prompt engineering, model fine-tuning, model evaluation.

 

- Publicly available data will be downloaded and used for the project.

- The candidate will be applying text processing and data exploration techniques to get acquainted with the dataset and create a data processing pipeline to prepare the data in a suitable format for model ingestion.

- The candidate will implement prompt engineering and/or fine-tuning techniques to adapt LLMs to healthcare-specific NLP tasks.

- At the end of the project the candidate will be familiarized with basic and advanced NLP techniques as well as state-of-the-art LLM implementation and development for specific use-cases.

 

Note: The scope of the project may change depending on company priorities. In addition, the student may be asked to contribute and support additional ongoing projects and duties on a on demand basis.


About you:

  • Be pursuing a Bachelor Degree in Business Analytics, Data Science, Computer Engineering, Computer Science or related discipline
  • Graduating in May/Dec 2025 or May 2026
  • Strong coding skills in Python programming language for data processing and model development is required
  • Knowledge and prior experience with deep learning algorithms is a plus
  • Experience with one or more deep learning frameworks, e.g., Tensorflow, Torch is preferred
  • Familiarity with git, github repositories and object-oriented programming is desirable
  • Ability to communicate effectively and present results and findings
  • Ability to multitask and work effectively as part of a multidisciplinary team
  • Ability to document comprehensively and rigorously internship project materials, including literature articles, code, results, findings, and slides
  • Passionate and keen to make a difference to re-imagine the future of HealthTech

 

The intern's work location will be at 1 Maritime Square #12-01 Harbourfront Centre Singapore 099253.

 

#LI-YG1