via ats_greenhouse · 11 June 2026 ·2 days ago

Co-Op, Data Extraction

LILA Sciences
Cambridge
17 jobs in Cambridge — and more nearby.
Upload your CV and see which ones actually match you.
Upload CV

<p><strong>Your Impact at LILA</strong></p>
<p>Lila Sciences builds AI systems that accelerate discovery across the physical and life sciences. Within Physical Sciences AI, our team works on turning unstructured scientific knowledge (e.g., literature, patents, technical reports) into structured signals that power downstream Lila applications. As a Data Extraction Co-Op, you will work alongside research scientists and engineers on a focused sub-problem in this stack. You will get hands-on experience fine-tuning and evaluating extraction models, building pipelines for messy real-world data, and shipping work that flows into production systems.</p>
<p><strong>What You'll Be Building</strong></p>
<ul>
<li>Contribute to AI systems that extract and structure knowledge from scientific literature and patents, focused on a well-defined sub-problem</li>
<li>Fine-tune and evaluate language, multimodal, or specialized models for data extraction, with mentor guidance</li>
<li>Build and test pipelines that structure unstructured scientific data across text, tables, and visuals</li>
<li>Run extraction pipelines, analyze results, and document findings clearly</li>
<li>Share your work through a team presentation, write-up, or contribution to a publication or open-source project</li>
</ul>
<p><strong>What You'll Need to Succeed</strong></p>
<ul>
<li>Pursuing a Bachelor's, Master's, or PhD in Computer Science, Chemistry, Materials Science, or a related field</li>
<li>Solid foundation in machine learning fundamentals and Python</li>
<li>Familiarity with NLP or computer vision concepts</li>
<li>Curiosity about scientific data and willingness to learn quickly in a research setting</li>
</ul>
<p><strong>Bonus Points For</strong></p>
<ul>
<li>Coursework or projects involving multimodal models or document understanding (OCR, table/figure extraction)</li>
<li>Experience working with messy, real-world datasets</li>
<li>Interest in scientific document parsing</li>
</ul><div class="content-conclusion"><p><strong>About LILA</strong></p>
<p>Lila Sciences is building Scientific Superintelligence™ to solve humankind's greatest challenges. We believe science is the most inspiring frontier for AI. Rather than hard-coding expert knowledge into tools, LILA builds systems that can learn for themselves.</p>
<p>LILA combines advanced AI models with proprietary AI Science Factory™ instruments into an operating system for science that executes the entire scientific method autonomously, accelerating discovery at unprecedented speed, scale, and impact across medicine, materials, and energy. Learn more at www.lila.ai.</p>
<p>Guided by our core values of truth, trust, curiosity, grit, and velocity, we move with startup speed while tackling problems of historic importance. If this sounds like an environment you'd love to work in, even if you don't meet every qualification listed above, we encourage you to apply.</p>
<p><strong>We’re All In</strong></p>
<p>Lila Sciences is committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status.</p>
<p>Information you provide during your application process will be handled in accordance with our <a href="https://www.lila.ai/candidate-privacy-policy-notice">Candidate Privacy Policy</a>.</p>
<p><strong>A Note to Agencies</strong></p>
<p><em>Lila Sciences does not accept unsolicited resumes from any source other than candidates. The submission of unsolicited resumes by recruitment or staffing agencies to Lila Sciences or its employees is strictly prohibited unless contacted directly by Lila Science’s internal Talent Acquisition team. Any resume submitted by an agency in the absence of a signed agreement will automatically become the property of Lila Sciences, and Lila Sciences will not owe any referral or other fees with respect thereto.</em></p></div>

The market for this type of role

Similar openings
17
Software roles in Cambridge
Full-time
80%
of Software roles in the UK
Remote possible
8%
of Software roles
LILA Sciences

112 open positions · Cambridge

📊 Software · the UK
1,464
active jobs
12.4%
Remote
Ø 2d
avg. online
Top skills in demand
PythonSQLREST APIJavaScriptReactJavaAgileDockerCI/CDAzure

Frequently asked questions

How many Software jobs are available in Cambridge?
Currently 17 Software roles in Cambridge on AlmostHired, across 5 different companies. Our data is updated daily.
Do Software roles offer remote work?
8% of Software roles in the UK allow remote work, either partial or full. To filter specifically for remote positions, use AlmostHired.
How do I know if I match this role?
Upload your CV — our AI compares your profile to the job requirements and gives you a precise match score, with matching and missing skills.