PhD Position AI Alignment: Value Assessment for Open Models & AI Systems
.jobdescription td { padding: 0 5px; } .jobdescription \* { border: none !important; border\-width: 0 !important; font\-size: 12px !important; font\-family: Arial, Helvetica, sans\-serif !important; color: black !important; } .jobdescription h1 b, .jobdescription h2 b, .jobdescription h3 b, .jobdescription h4 b, .jobdescription h1, .jobdescription h2, .jobdescription h3, .jobdescription h4 { font\-size: 14px !important; font\-weight: bold; margin\-bottom: 0\.5em !important; }
\#search\-wrapper{display:none;}
/\* TUDelft Sidebar style \*/ .jobColumnTwo .joblayouttoken { background: \#e3e3e3; color: \#011b3c; padding: 0 20px 5px !important; } .jobColumnTwo .joblayouttoken .joblayouttoken\-label { color: \#039fd8; font\-weight: bold; } .jobColumnTwo .joblayouttoken.marginTopLarge { padding: 20px 20px 5px 20px !important; } .jobColumnTwo .joblayouttoken.marginBottomLarge { padding: 0 20px 20px 20px !important; }
/\* TU Delft full\-width header image \*/ .topleft.verticallyscaled.backgroundimage.large\-image\-component { height: 0 !important; } div\[role\='img'] .topleft.scaled.large\-image\-component{ height: 0 !important; position: absolute; padding\-top: 20% !important; background\-position: center !important; top: 50px; } div.job{ position: relative; padding\-top: 26% !important; } /\* Fix voor niet werkende knop \*/ div.jobTitle{ z\-index: 2; position: relative; margin\-top: 5px; }
PhD Position AI Alignment: Value Assessment for Open Models \& AI Systems
Challenge: Developing, operationalizing, quantifying, and embedding complex human and legal values into alignment pipelines for AI systems, open\-weights, and foundation models.
Change: Advancing from static, generic benchmarks to dynamic, automated validation and red\-teaming frameworks tailored for high\-risk deployments.
Impact: Enhancing police trustworthiness through AI alignment at the Netherlands Police and ensuring compliance with the EU AI Act by engineering measurably aligned AI systems.
Job description
AI alignment refers to the goal of making AI systems behave in line with human intentions and values. AI alignment ensures that advanced AI systems operate safely and strictly within the bounds of human intentions, ethical standards, and the prevalent legal frameworks. With the rapid proliferation of AI systems, frontier LLMs, multimodal models, autonomous agents and their growing capability, there are equal risks of misalignment with human, organizational, and societal values through behavioral drift, hallucination, and adversarial exploitation. Validating models is crucial before decisions can be made about implementation and is important for continuous monitoring of systems in use, and for facilitating effective human oversight of AI. This is particularly important in high\-stakes environments like law enforcement. The main challenge is that validation needs to happen simultaneously along a range of different values that are important in a law enforcement context: accuracy, but also fairness, reliability, trustworthiness, and more need to be ensured. How can we translate abstract democratic, organisational, and societal values such as algorithmic fairness, transparency, explainability (XAI) into rigorous, quantifiable engineering metrics without sacrificing the general utility of said models and AI systems?
As a PhD student at TU Delft, you will conduct impactful research on two key aspects in advancing the responsible use of AI within the Netherlands Police force. First, you will investigate the standards and values surrounding AI usage, particularly in the context of publicly available models. This entails defining what criteria these models must meet, beyond common considerations like bias and fairness. Second, you will also design methods to systematically evaluate various models against these established standards and values. This contributes to the responsible deployment of AI within policing in the Netherlands, pushes forward our understanding of how to align AI models in practice, and maximizes the efficiency of utilizing publicly available models.
1\. Formalizing Value Taxonomies and Alignment Metrics
You will investigate the ethical, legal, and operational guardrails required for deploying open\-weights foundation models in sensitive public\-facing domains. Moving beyond superficial bias benchmarks, you will conduct deep\-dive case studies within the Netherlands Police to map operational requirements to formal alignment criteria. You will define what constructs such as "trustworthiness" and "fairness" mean mathematically and procedurally when applied to complex law enforcement workflows.
2\. Engineering (Automated) and Human\-in\-the\-Loop Evaluation and Red\-Teaming Pipelines
You will design and implement scalable methodologies to systematically stress\-test, audit, and benchmark AI models against your established criteria. This includes exploring red\-teaming methods, synthetic data generation for vulnerability probing, and investigating how downstream alignment techniques (e.g., DPO, RLHF, or constitutional AI) can be customized to enforce strict adherence to organizational values.
Your project is part of the Model\-Driven Decisions Lab, a Netherlands Police \- TU Delft initiative, where you will join an interdisciplinary community of four fellow PhD students who have already been hired. Together, you will share knowledge to tackle AI\-assisted decision\-making from different perspectives. To foster close collaboration with the stakeholders and work on practical implementation, you will spend 20% of your time at the Netherlands Police’s strategy and innovation division. Given the ethical and moral facets of your research, you will also work closely with colleagues of the Delft Digital Ethics Centre at the Faculty of Technology, Policy, and Management (TPM). Your home base will be the Web Information Systems research group at the Computer Science faculty (EEMCS). As an internationally diverse team of driven academics and students, we cultivate a welcoming and collaborative environment. We will give you all the support and training you need to evolve both personally and professionally. Learn more about your project at the Model\-Driven Decisions Lab.
Job requirements
You hold an MSc in computer science, data science, or another relevant subject such as ethics of AI, with practical machine learning/artificial intelligence courses and relevant project and thesis experience.
You have a keen interest in AI alignment, human\-AI interaction, and explainable AI, and enjoy collaborating with experts in different disciplines.
You thrive on conducting research geared to real\-world applications in the security domain and are intrinsically motivated to collaborate with the Netherlands Police.
You harness your communication skills to work with different scientific and nonscientific stakeholders in different work cultures.
You have a good command of written and spoken English, as you will be working in an international environment. Since you will be working with the Netherlands Police, one of our pre\-requisites for a suitable candidate is to have a good command of the Dutch language. This is a strong requirement due to the context of the project that will need interactions with stakeholders and data in the Dutch language.
TU Delft (Delft University of Technology)
Working at TU Delft means contributing to solutions that really make a difference.
For over 180 years, we have been training engineers who make an impact worldwide in companies, government bodies, or as entrepreneurs. Our alumni turn knowledge into concrete solutions for the challenges of today and tomorrow.
These challenges are changing rapidly. That is why we focus on themes such as energy, climate, digitalisation, artificial intelligence (AI), and smart mobility every day. Our education and research are directly aligned with what society needs now and in the future.
At TU Delft, our people make the difference. With their knowledge and curiosity, our staff provide a high\-quality education and conduct pioneering research that extends beyond the campus. You will have the opportunity to take the initiative, work with others, and grow as a professional.
Working at TU Delft means join an international community of professionals and students. Together, we create knowledge, innovations, and solutions that help move the world forward.
Faculty of Electrical Engineering, Mathematics and Computer Science
The Faculty of Electrical Engineering, Mathematics and Computer Science (EEMCS) brings together three scientific disciplines. Combined, they reinforce each other and are the driving force behind the technology we all use in our daily lives. Technology such as the electricity grid, which our faculty is helping to make completely sustainable and future\-proof. At the same time, we are developing the chips and sensors of the future, whilst also setting the foundations for the software technologies to run on this new generation of equipment – which of course includes AI. Meanwhile we are pushing the limits of applied mathematics, for example mapping out disease processes using single cell data, and using mathematics to simulate gigantic ash plumes after a volcanic eruption. In other words: there is plenty of room at the faculty for ground\-breaking research. We educate innovative engineers and have excellent labs and facilities that underline our strong international position. In total, more than 1000 employees and 4,000 students work and study in this innovative environment.
, Mathematics and Computer Science.
Conditions of employment
Pending the screening result, a temporary employment contract as a researcher can be offered for up to 4 months, if requested by the candidate. This contract will be converted to a PhD contract upon a positive screening result. These are 5\-year PhD positions, with the extra fifth year (compared to a standard 4\-year PhD program) allowing for the additional activities o
Deze vacature komt van indeed. Originele vacature bekijken ↗