Lead Data Scientist - Gen AI & Digital Twin

Other Jobs To Apply

Career Area:

Technology, Digital and Data

Job Description:

Your Work Shapes the World at Caterpillar Inc.

When you join Caterpillar, you're joining a global team who cares not just about the work we do – but also about each other. We are the makers, problem solvers, and future world builders who are creating stronger, more sustainable communities. We don't just talk about progress and innovation here – we make it happen, with our customers, where we work and live. Together, we are building a better world, so we can all enjoy living in it.

The Cat® Digital group is the digital and technology arm of Caterpillar Inc., responsible for bringing world class capabilities to our products and services. With over one million connected assets worldwide, we're focused on using data, advanced analytics, and AI capabilities to help our customers build a better world. To accomplish this, we’re deploying analytics that generate insights, recommend optimized decisions, and improve products by intelligently integrating massive quantities of telematics information, transactional records, images, unstructured documents, and other data sources.


Job Summary:
The Condition Monitoring Analytics team of Cat Digital is seeking a Lead Data Scientist to be a technical expert, working in a team environment, to drive development & integration of digital twins & GenAI-assisted predictive analytics for condition monitoring of Caterpillar equipment.

What You Will Do:

Algorithm Development & Modeling

  • Anomaly Detection: Design and implement GPU-accelerated machine learning models (e.g., XGBoost, autoencoders, and GANs using Tesseract) to identify fault patterns in timeseries sensor data.
  • Digital Twin Engineering: Partner with engineering teams to develop onboard digital twins using NVIDIA architecture (e.g. PhysicsNeMo) to simulate, predict, and optimize the performance of heavy machinery
  • Optimization: Profile and tune deep learning algorithms for maximum efficiency on NVIDIA GPU architectures, ensuring high throughput and low latency for real-time monitoring.

Testing onboard Architecture & Integration

  • Edge Deployment: Adapt and test algorithms for onboard architecture, leveraging tools like NVIDIA Jetson for ROM generation and real-time edge processing on Cat equipment.
  • Hardware-Software Co-Design: Collaborate with hardware / simulation engineers to ensure algorithm compatibility with next-generation processors and specialized onboard compute modules.
  • Simulation-Based Training: Use high-fidelity digital twins to simulate rare failure scenarios, ensuring the GenAI assistant provides accurate troubleshooting steps for edge-case mechanical issues.

GenAI workflow & Orchestration

  • Automated Diagnostic Workflows: Develop Generative AI agents that synthesize telematics data to generate prioritized repairs for identified machine faults.
  • Unified Data Orchestration: Integrate multi-modal outputs from condition monitoring analytics & asset life history to create a machine-specific context for AI assistant.



Considerations for Top Candidates:

  • Generative AI & LLMs: Proficiency in Fine-tuning and Prompt Engineering for Large Language Models, specifically using Retrieval-Augmented Generation (RAG)
  • Condition Monitoring Algorithms: Deep understanding of Anomaly Detection, Time-Series Analysis, and Predictive Maintenance models.
  • Telematics: Experience handling high-frequency IoT sensor data, CAN bus protocols (J1939), and integrating with unified data platforms
  • Experience with High performance computing
  • Business Statistics: Extensive experience with statistical tools, processes, and practices to describe business results in measurable scales; ability to use statistical tools and processes to assist in making business decisions.
  • Analytical Thinking: Extensive knowledge of techniques and tools that promote effective analysis; ability to determine the root cause of organizational problems and create alternative solutions that resolve these problems.
  • Programming Languages: Extensive knowledge of basic concepts and capabilities of applying Python programming to solve business challenges; ability to use tools, techniques and platforms in order to write and modify programming languages.
  • Requirements Analysis: Working knowledge of tools, methods, and techniques of requirement analysis; ability to elicit, analyze and record required business functionality and non-functionality requirements to ensure the success of a system or software development project.

What You Will Have:

  • Typically, a Bachelors, Masters, or PhD degree in Applied Statistics, Data Science, Business Analytics, Predictive Analytics, Business Intelligence & Analytics, Mathematics, Computer Science, Engineering (Aerospace, Electrical, Mechanical, Computer, Industrial, Agricultural, etc.), or equivalent technical degree
  • Extensive experience applying Python (NumPy, SciPy, pandas, etc.) programming to solve business challenges.
  • Extensive experience with advanced data analysis, machine learning such as clustering, Log regressions, neural nets and statistical methods such as statistical process control, etc. (typically 8+ years)
  • Experience in practical applications of onboard architecture / software (e.g. mini projects using Raspberry Pi or any other architecture is a bonus)
  • Working experience with heavy equipment engineering or data analysis.
  • Working knowledge with cloud technologies (AWS, Azure, Google Cloud, etc.)
  • Advanced experience with version control / repositories such as GitHub
  • Experience operating in an Agile environment
  • Must demonstrate strong initiative, interpersonal skills, and the ability to communicate effectively.

About Caterpillar

Caterpillar Inc. is the world’s leading manufacturer of construction and mining equipment, off-highway diesel and natural gas engines, industrial gas turbines and diesel-electric locomotives. For nearly 100 years, we’ve been helping customers build a better, more sustainable world and are committed and contributing to a reduced-carbon future. Our innovative products and services, backed by our global dealer network, provide exceptional value that helps customers succeed.

Summary Pay Range:

$128,470.00 - $208,770.00

Compensation and benefits offered may vary depending on multiple individualized factors, job level, market location, job-related knowledge, skills, individual performance and experience. Please note that salary is only one component of total compensation at Caterpillar.

Benefits:

Subject to plan eligibility, terms, and guidelines. This is a summary list of benefits.

  • Medical, dental, and vision benefits*

  • Paid time off plan (Vacation, Holidays, Volunteer, etc.)*

  • 401(k) savings plans*

  • Health Savings Account (HSA)*

  • Flexible Spending Accounts (FSAs)*

  • Health Lifestyle Programs*

  • Employee Assistance Program*

  • Voluntary Benefits and Employee Discounts*

  • Career Development*

  • Incentive bonus*

  • Disability benefits

  • Life Insurance

  • Parental leave

  • Adoption benefits

  • Tuition Reimbursement

* These benefits also apply to part-time employees

Visa Sponsorship is not available for this position.

Posting Dates:

June 4, 2026 - June 10, 2026

Any offer of employment is conditioned upon the successful completion of a drug screen.

Caterpillar is an Equal Opportunity Employer, Including Veterans and Individuals with Disabilities. Qualified applicants of any age are encouraged to apply.

Not ready to apply? Join our Talent Community.

Back to blog

Common Interview Questions And Answers

1. HOW DO YOU PLAN YOUR DAY?

This is what this question poses: When do you focus and start working seriously? What are the hours you work optimally? Are you a night owl? A morning bird? Remote teams can be made up of people working on different shifts and around the world, so you won't necessarily be stuck in the 9-5 schedule if it's not for you...

2. HOW DO YOU USE THE DIFFERENT COMMUNICATION TOOLS IN DIFFERENT SITUATIONS?

When you're working on a remote team, there's no way to chat in the hallway between meetings or catch up on the latest project during an office carpool. Therefore, virtual communication will be absolutely essential to get your work done...

3. WHAT IS "WORKING REMOTE" REALLY FOR YOU?

Many people want to work remotely because of the flexibility it allows. You can work anywhere and at any time of the day...

4. WHAT DO YOU NEED IN YOUR PHYSICAL WORKSPACE TO SUCCEED IN YOUR WORK?

With this question, companies are looking to see what equipment they may need to provide you with and to verify how aware you are of what remote working could mean for you physically and logistically...

5. HOW DO YOU PROCESS INFORMATION?

Several years ago, I was working in a team to plan a big event. My supervisor made us all work as a team before the big day. One of our activities has been to find out how each of us processes information...

6. HOW DO YOU MANAGE THE CALENDAR AND THE PROGRAM? WHICH APPLICATIONS / SYSTEM DO YOU USE?

Or you may receive even more specific questions, such as: What's on your calendar? Do you plan blocks of time to do certain types of work? Do you have an open calendar that everyone can see?...

7. HOW DO YOU ORGANIZE FILES, LINKS, AND TABS ON YOUR COMPUTER?

Just like your schedule, how you track files and other information is very important. After all, everything is digital!...

8. HOW TO PRIORITIZE WORK?

The day I watched Marie Forleo's film separating the important from the urgent, my life changed. Not all remote jobs start fast, but most of them are...

9. HOW DO YOU PREPARE FOR A MEETING AND PREPARE A MEETING? WHAT DO YOU SEE HAPPENING DURING THE MEETING?

Just as communication is essential when working remotely, so is organization. Because you won't have those opportunities in the elevator or a casual conversation in the lunchroom, you should take advantage of the little time you have in a video or phone conference...

10. HOW DO YOU USE TECHNOLOGY ON A DAILY BASIS, IN YOUR WORK AND FOR YOUR PLEASURE?

This is a great question because it shows your comfort level with technology, which is very important for a remote worker because you will be working with technology over time...