Are currently pursuing a PhD in Computer Science, Machine Learning, AI, or a closely related field, with active research in LLMs, agents, reinforcement learning……
Develop benchmark tools and performance optimization of AI workloads specifically tailored for large-scale LLM training and inference, as well as High-……
Collaborating with engineers to develop, evaluate, and optimize software performance in large GPU clusters. Current enrollment in a Bachelor’s, Master’s, or PhD……
Our team interacts with OS container technologies, GPU compute, and systems specialists to architect, develop and bring up large scale performance software……
Knowledge of Cloud-based technologies (e.g. AWS, Azure, Google Cloud); certification is a plus. During this program the service member will be on-site at his or……
Experience with evaluation of LLMs or agents, including hallucination analysis, benchmark design, tool-use evaluation, prompt-injection testing, red teaming, or……
Currently enrolled in a Master’s or PhD program in Computer Science, Artificial Intelligence, Data Science, Knowledge Engineering, Information Science, or a……
Currently enrolled in a Master’s or PhD program in Computer Science, Electrical Engineering, Computer Engineering, Software Engineering, Networking, Cyber-……
Selected interns for this paid opportunity will be provided with the opportunity to mentor with experienced professionals, gain experience and establish a name……
Currently has, or is in the process of obtaining, a PhD degree in EE/CS, Applied Math or a related STEM field. Design of user studies and experiments.…
Support a variety of engineering tasks with the goal to develop technical, social, and ethical skills. Assist with project research, field work, and preliminary……
Support a variety of engineering tasks with the goal to develop technical, social, and ethical skills. Assist with project research, field work, and preliminary……
Support a variety of engineering tasks with the goal to develop technical, social, and ethical skills. Assist with project research, field work, and preliminary……
Currently pursuing a PhD in Computer Science or a related technical discipline. Demonstrated software engineering experience from previous internship, work……
PhD degree in computer science or related technical disciplines. Communicate cross-functionally across various teams, organizations and internal and external……
Mentor interns and junior researchers to develop technical growth within the team. This role combines hands-on software engineering with applied research in……
Experience with magic state preparation, distillation, or factory design, including decoding and performance analysis of magic state factories.…
PhD, Master’s, or Bachelor's degree in Computer Science, Applied Math, or related science or engineering field of study (or equivalent experience).…
PhD or MSc degree in Computer Science, Computational Science and Engineering, Applied Mathematics, or related science or engineering field (or equivalent……
PhD with 1+ year, or MS (or equivalent experience) with 5+ years, of relevant experience in Computer Science, Computer Engineering, or a related technical field……
We are seeking candidates with a proven track record of research excellence, systems-building experience, a broad perspective across the field of system……
As a member of the team, you will develop new agentic AI solutions to accelerate software design, code generation, performance improvement, testing, and every……
Use AI to find out how well the skills on your resume fit this job description.
Basic Information
Job Name
Applied Research PhD Intern - AgenticAI
Country
United States
State
NA
Date Published
07-May-2026
Job ID
46802
This role can be based remotely in United States
Description and Requirements
#LI-BS1
Hybrid: #LI-Hybrid
BMC empowers nearly 80% of the Forbes Global 100 to accelerate business value, faster than humanly possible. Our industry-leading portfolio unlocks human and machine potential to drive business growth, innovation, and sustainable success. BMC does this in a simple and optimized way by connecting people, systems, and data that power the world’s largest organizations so they can seize a competitive advantage.
BMC Software runs the systems that the world's largest enterprises depend on — mainframes, automation, and the control plane underneath them. Putting agentic AI into that environment raises the bar: every agent's action must be grounded, auditable, and reversible. The Office of the CTO is working on the AI Foundation that makes this possible across BMC's product lines, and the heart of it is an Enterprise Agent Gym — the evaluation harness and experimentation loop that turns “the prototype worked” into “the agent is safe to promote to production.”
Here is how, through this role, you will contribute to BMC’s and your own success:
Work directly with members of Technical Staff in the Office of the CTO, on the evals and experimentation layer
that BMC AI products are built on.
Design evaluations that catch the failure modes of enterprise agents: hallucinated tool calls, policy violations, context collapse, regression under distribution shift, etc.
Build the Agent Gym — task definitions, graders, reward signals, and trajectory capture — for multi-step agentic workflows.
Run experimentation sweeps across prompts, models, and scaffolds; quantify trade-offs between accuracy, cost, and latency.
Turn eval results into promotion gates and readiness reports that product teams can act on.
Contribute to our Responsible AI tooling — grounding checks, policy enforcement, and human-in-the-loop escalation paths.
What you'll take on
Your project will be part of the BMC AI Foundation’s active workstreams and shaped as a focused PhD-level research internship: Agent Gym evaluations, grader design, experimentation tooling, dataset curation, or trace / replay infrastructure. Exact scope is matched to your doctoral research strengths during onboarding, with your technical mentors, and is sized to produce a concrete research artifact, prototype, or evaluation result within 12 weeks.
We think you'll do well if you
Are currently pursuing a PhD in Computer Science, Machine Learning, AI, or a closely related field, with active research in LLMs, agents, reinforcement learning, AI safety, or evaluation methodology.
Have produced non-trivial research or systems that work on modern LLM and agent stacks — multi-step tool-using agents, RAG pipelines, evaluation harnesses, and post-training.
Can turn an open research question into testable hypotheses, choose strong baselines and ablations, interpret learning curves or reward trajectories honestly, and communicate findings clearly.
Treat evaluation as a first-class AI research and engineering problem, not just a reporting layer.
Especially strong signals
Published, submitted, or in-progress PhD research on LLM evaluation, agent benchmarks, alignment, RL environments, or related systems.
Hands-on research experience with RLHF / RLVR, reward modeling, synthetic data generation, red-teaming, or scalable evaluation design.
Contributions to open-source eval harnesses, agent scaffolds, observability tooling, or reproducible research infrastructure.
Clear thinking about AI safety, deployment risk, benchmark validity, and the gap between academic results and enterprise production use.
#LI-Remote
CA-DNP
Our commitment to you!
BMC’s culture is built around its people. We have 6000+ brilliant minds working together across the globe. You won’t be known just by your employee number, but for your true authentic self. BMC lets you be YOU!
If after reading the above, You’re unsure if you meet the qualifications of this role but are deeply excited about BMC and this team, we still encourage you to apply! We want to attract talents from diverse backgrounds and experience to ensure we face the world together with the best ideas!
BMC is committed to equal opportunity employment regardless of race, age, sex, creed, color, religion, citizenship status, sexual orientation, gender, gender expression, gender identity, national origin, disability, marital status, pregnancy, disabled veteran or status as a protected veteran. If you need a reasonable accommodation for any part of the application and hiring process, visit the accommodation request page.
BMC Software maintains a strict policy of not requesting any form of payment in exchange for employment opportunities, upholding a fair and ethical hiring process.
The minimum salary is $74K and the max salary is $123K.
$74K – $123K/yr (Employer provided)
$98K
/yr Median
New York, NY
If an employer includes a salary or salary range on their job, we display it as "Employer Provided". If a job has no salary data, Glassdoor displays a "Glassdoor Estimate" if available. To learn more about "Glassdoor Estimates," see our FAQ page.
Working here doesn’t have to be a secret
Sign in to browse authentic reviews, anonymous ratings and salary data before you apply.