Project Details
Project Description
Leveraging on our on-going expertise and progress in the Trustworthy Machine Learning project with DST since 2018, this project aims to extend the scope to LLMs with a focus on their safety and reliability, materialised into four aims: (i) develop adversarial attacks and defences on LLMs using geometric AI principles, (ii) study semantic uncertainty and calibration in blackbox/whitebox LLMs to improve output worthiness , and (iii) align LLMs toward Human-AI teaming with diverse human values via multi-objective optimization; and lastly more open aim to (iv) develop multimodal foundation LLMs robustness to include vision-language, speech and other structured and unstructured modalities such as human physiological signals.
Short title | Trustworthy Generative AI |
---|---|
Acronym | TMLGenAI |
Status | Active |
Effective start/end date | 11/06/24 → 10/07/26 |
Equipment
-
MASSIVE
Powell, D. (Manager) & Tan, G. (Manager)
Office of the Vice-Provost (Research and Research Infrastructure)Facility/equipment: Facility