- Career Center Home
- Search Jobs/Education
- Research Lead - AI Cyber Testing & Evaluation
Description
The Research Lead AI Cyber Testing & Evaluation will serve as a senior technical leader, directing a comprehensive research portfolio focused on assessing the offensive cyber capabilities of frontier AI models.
This role involves managing significant research budgets and personnel, overseeing complex technical and policy analysis projects, and leading multidisciplinary teams of policy researchers, engineers, and scientists. The team will build systems to evaluate AI model performance across the full attack lifecycle, including resource development, initial access, discovery, lateral movement, and defense evasion.
Projects may include developing benchmarks for fully autonomous operations using scaffolding and tools, as well as uplift of both novice and expert humans. Benchmarks may involve capture-the-flag (CTF) challenges, frameworks for assessing reasoning over attack graphs and multi-stage operations, evaluations of stealth and defense evasion capabilities, and assessments of time-sensitive operations at machine speed.
Many evaluations will be commissioned by government agencies, with results shaping responsible AI policy worldwide. Findings will be communicated through detailed technical analyses, evaluation frameworks, and quick-turnaround policy briefs, directly informing recommendations for senior policymakers, regulators, the intelligence community, other governments, and industry leaders.
This position is structured as a focused two-year appointment to drive urgent and ambitious change in this rapidly evolving field. The appointment may be renewed for an additional year, with potential opportunities for longer-term employment thereafter.
Requirements
Qualifications
Required:
6+ years of technical experience in security engineering, software engineering, firmware engineering, hardware engineering, or related fields.
6+ years of technical management experience, including leading cross-functional teams, managing project budgets, and mentoring/developing team members.
Demonstrated ability to successfully lead complex projects to completion.
Proficiency in Python, Java, C/C++, or other popular programming languages.
Experience with red team operations or offensive cyber capabilities development.
Ability to develop rigorous threat models and identify potential system vulnerabilities.
Strong verbal and written communication skills.
Ability to work effectively in a collaborative, multidisciplinary environment.
Proficiency with Microsoft Office Suite.
Preferred:
Graduate of CNODP, RIOT, FORGE, or equivalent programs/experience.
Understanding of advanced persistent threat (APT) tactics, techniques, and procedures (TTPs), and experience defending against them.
Ability to think creatively about offensive/defensive strategies beyond compliance-driven approaches.
Experience in AI research, ML model training, or deployment.
Education Requirements
This position is open at the specialist or expert level:
PhD in Computer Science, Computer Engineering, Electrical Engineering, Cybersecurity, Information Security, Information Technology, Mathematics, Applied Mathematics, Physics, Applied Physics, Engineering Physics, Artificial Intelligence, Machine Learning, Engineering and Public Policy, Technology and Policy, National Security Policy, Policy Analysis, Political Science, International Relations, or similar, with at least 3 years of relevant experience.
OR
Masters degree in the above fields with at least 6 years of relevant experience.
OR
Bachelors degree in the above fields with at least 8 years of relevant experience.
Advanced degrees (Masters or PhD) are preferred.
Security Clearance
Ability to obtain and maintain a U.S. government clearance is preferred but not required.