What you can learn from Rivian's AI, data strategy
Rivian has new models planned and a $5 billion partnership with Volkswagen Group that may enable the electric vehicle maker to emerge as a top software-defined-vehicle provider. The journey for Rivianâs integrated vehicle stack includes a hefty dose of artificial intelligence (AI), machine learning (ML), and cloud computing.
The Rivian journey highlights how cloud and data transformations are required to take that next AI step. The payoff? VW initially will invest $1 billion in Rivian and then invest another $4 billion over time. In return, Rivian will develop software for its own vehicles as well as VWâsâas well as its hardware designs, electrical systems, and integrated platform largely powered by Amazon Web Services (AWS) and Databricks.
As the combined Rivian-VW fleet grows, these vehicles will provide more data that can then be used to train models, add new features, and improve efficiency. VW CEO Oliver Blume said in a statement that the Rivian partnership will lower costs, improve vehicle experiences, and strengthen âour technology profile and our competitiveness.â
For Rivian CEO RJ Scaringe, the VW partnership is a validation of the companyâs strategy. Rivian has plans to launch its R2, R3, and R3X vehicles based on a new midsize platform starting in 2026. Rivian also launched its next-generation chargers and the second generation of its flagship R1S and R1T vehicles in June 2024.
While Rivian is focused on delivering new vehicles, becoming more efficient, leveraging scale, and reinventing driver experiences, there has been a lot of technology legwork behind the scenes. Hereâs a look at Rivianâs plan and the data and cloud strategy that enables it.
Rivianâs Master Plan
During the companyâs 2024 Investor Day in June, Scaringe talked about the transition to electric vehicles, building âhighly compelling products,â and leveraging vertical integration and scale as Rivian moves from its first-generation vehicles to its second with R2 and R3. Rivianâs strategy is Apple-ish in that it deeply integrates hardware, software, and experiences powered by data, ML, and AI.
âFrom the very beginning, we said that we need to own the electronics in the vehicleâthat allows us to own the network architecture, software stack, and hardware within the vehicle as it evolves,â Scaringe said. âItâs really important to control the full stack.â
Scaringe said the Rivian network includes its commercial vans as well as its consumer vehicles. The data provided by those vehicles influences how Rivianâs platform develops and simplifies everything from the operation of a Rivian vehicle to the customer experience. Rivianâs full-stack approach means it controls the camera hardware, radar systems, and data that ultimately trains AI models. âOne of our big assets is a large fleet of vehiclesâa growing fleet of vehicles that continue to get better. We train those platforms with very powerful compute and a really robust sensor set,â he said.
That data from Rivianâs fleet also enables the company to optimize everything from battery modules to electronics, drive units, and power trains. Ultimately, that optimization will lead to profitability. Rivianâs latest generation of vehicles will feature quad motors that are more efficient and save money across the company.
Rivianâs maintenance and service efforts also fuel the data flywheel. The company continues to build out its brick-and-mortar infrastructure, but most service actions are either over the air or via a mobile van.
According to Scaringe, â70% of our service actions happen with a mobile service that uses the diagnostics platform built into the vehicle and allows us to be highly predictive with what the vehicles need.â
If something is wrong in your vehicle, he continued, âwe know about it. We can make sure we have the parts for it and we can then go to you, go to your house, go to your business, and take care of whatever service actions are needed. Today our service network is ramping, but over time this becomes a very powerful part of our business as well.â
Scaringe added that the mobile service effort enhances the customer experience. Rivian is also collecting data from charging networks to enhance uptime. Rivianâs software stack features a charging score to rate charging networks based on efficacy.
Rivianâs investor meeting was designed to highlight the companyâs approach to AI and vertical integration, but also to outline its path to profitability by driving down the cost of bill of materials. In 2023, Rivian produced 57,232 vehicles and posted a net loss of $5.43 billion, an improvement compared to the 2022 net loss of $6,752 billion.
For Rivian to deliver a profit, it will have to leverage the data flywheel, AI-based driving experience, and autonomous technology it is creating (see Figure 1).
Figure 1. Rivian Continuously Updates Its Data to Train Models and Improve Customer Experiences

Source: Rivian
âWhen you think about these vehicles as being a series of models, you really want to be able to train these things end to end,â James Philbin, vice president of Autonomy and AI at Rivian, said. According to Philbin, supplier-based components donât provide the data stream that Rivian-built software and hardware can. âWe want the system to be simulated in on the cloud. The concept of vertical integration is even more important in autonomy. We want to be AI-first, using the rich, powerful models, using lots of data for training and leveraging our fleet,â he said.
The company is expecting to produce 57,000 units in 2024 and reduce its material costs by about 20% as it transitions to its Gen2 platform. Rivian expects lower material costs will bring it to positive gross profit per vehicle in the fourth quarter of 2024. The company is targeting positive adjusted earnings before interest, taxes, depreciation, and amortization (EBITDA) in 2027, and its partnership with Volkswagen reinforces the companyâs balance sheet with a $5 billion investment (see Figures 2 and 3).
Figure 2. Rivian Is Planning to Aggressively Reduce Its Costs to Drive to Profitability

Source: Rivian
Figure 3. Rivian Is Focusing Heavily on Driving Costs Down for Manufacturing Its Vehicles

Source: Rivian
80% of Rivianâs Software Is Built In-House
According to Wassym Bensaid, Rivianâs chief software officer, the company builds 80% of its software, and thereâs a good reason for that: The Rivian experience is driven by software. To drive the point home, Bensaid opened his Investor Day presentation highlighting a customer that goes to the gym three times a week, wakes up at 7 a.m., and walks to her car to find it configured to her preferred temperature and preferred music service. All she did was pick up her smartphone.
That 30-second experience would have required more than 30 suppliers, software interfaces, and integration points, Bensaid said. To change the experience would require more coordination with all of those suppliers. âWith the vertical integration we have between hardware and software, we developed a much cleaner and simpler architecture with the powerful compute we have,â he said. âWe developed an end-to-end integrated and connected platform with more than 80% of the software done in-house. We orchestrate the choreography of all these features.â
Figure 4. Rivianâs Software Platform Is Designed to Work Across Its Fleet, Including Yet-to-Be-Named Vehicles

Source: Rivian
In other words, Rivian is looking to perfect the software-defined vehicle (see Figures 4 and 5).
Bensaid outlined the following:
- Rivianâs architecture is scalable and modular.
- There are multiple abstraction layers for different hardware configurations.
- The low-level architectures run on the same binary code across the Rivian fleet.
- Rivianâs core software foundation can create multiple variants. That software architecture was one of the big reasons Volkswagen partnered with Rivian.
- AI and models are pervasive across the company, including manufacturing, supply chain, commercial, and direct-to-consumer experiences.
- Sensor data is fed into Rivian Cloud, with vehicle features updated over the air.
âThe product today is very different than the product our customers bought two years ago,â Bensaid said, âand the product in two years will also be very different and much improved versus where we are today.â
This evolution is possible due to a connected data platform built on Databricks and AWS. âThe connected data platform is at the heart of our entire software stack, and itâs as important as the embedded software,â Bensaid said. âWe can run smart workloads in the cloud. In some cases, once they mature through machine learning, we can move them to the edge. Weâre building this data foundation with security and privacy in mind, and with the anonymized data, we can continue to influence the product over time. With the collaboration with the Volkswagen Group, we will have much more scale, and this data platform will become even more important.â
Ultimately, Rivian has built a vehicle operating system thatâs embedded when a car moves through the assembly line. Rivian creates a âclean sheet diagnostic solutionâ that ensures reliability and safety and enables servicing later. This system also can find issues on the line so Rivian can fix them before a vehicle leaves the factory. âInstead of having a defect escape and go to the end of the lineâwhich would take more time and money to disassemble the vehicleâweâre able to detect that as the vehicle is moving through the line. Itâs a massive efficiency unlock for us, and itâs a higher-quality bar for our vehicles,â Bensaid said.
Figure 5. Rivianâs Cloud Infrastructure Has Sped Up the Companyâs Software Release Cycle

Source: Rivian
The software-defined vehicle approach also impacts the customer experience, Bensaid explained. With telemetry and remote diagnostics, service technicians donât need to physically inspect vehicles. If over-the-air updates donât work, Rivian can deploy a mobile service truck.
âThis is a massive efficiency saving when you compare that with the legacy auto model where, for any issue that you have, you need to bring the car back to the dealership. And with the data platform that we have and integration of AI, we can now unlock many more capabilities,â Bensaid said (see Figures 6 and 7).
Figure 6. According to Rivian, the Legacy Model in Auto Manufacturing Doesnât Scale Well for Software-Defined Vehicles

Source: Rivian
Figure 7. Rivianâs Model Is to Vertically Integrate the Entire Value Chain for Simplicity

Source: Rivian
Rivianâs ability to create a software-defined vehicle relies on integrated hardware and a bevy of sensors and vehicle compute. Vidya Rajagopalan, vice president of Engineering and Hardware at Rivian, said Rivian has decided to build its own hardware purpose-built for the companyâs vehicles, with every model today and going forward built on the same scalable hardware stack. âWe use the exact same hardware platform on all of them. The reason we do that is we donât want unwanted features. Weâre not buying something off the shelf that has features for a broad audience,â she said.
Rajagopalan added that Rivian is also building its own hardware stack to eliminate margin stacking from suppliers. âWe were very clear that we were not going to follow that legacy philosophy of hardware for really small pieces and small tasks,â Rajagopalan said. âNot only does that approach add cost, but it also creates huge complexity in software.â
According to Rajagopalan, Rivian is using one electronic control unit (ECU) per broad function category (see Figure 8). Typically, ECUs are used for a particular function such as thermal management, body controls, wipers, door handles, and other functions. Rivian has reduced the number of ECUs from 17 to 7.
Figure 8. Rivian Has Dramatically Simplified Its Electronic Control Unit Systems

Source: Rivian
In autonomy, Rivian has merged four different ECUs with one using Nvidia Drive and upgraded multiple sensors with high-speed interfaces. âWe wanted to put the best sensing out there because we know in the AI world itâs really important that you get data for training. We also donât want to keep upgrading sensors, because then your data becomes stale,â Rajagopalan said.
Going forward, Rajagopalan said, Rivian will continue to consolidate and optimize ECUs to lower costs and gain supply chain leverage that will only be improved with the VW partnership.
AWS and Databricks
At the Databricks AI Summit in 2023, Rivianâs Bensaid outlined the companyâs data strategy.
Bensaid said Rivian initially struggled with data silos and multiple systems to the point that the bottleneck became the companyâs data strategy.
As a result, Rivian used Databricksâ Lakehouse platform to build a new architecture on top of its data lakes. Databricksâ Unity Catalog is used to create one version of the truth.
Databricks is then connected to multiple AWS services to round out Rivianâs cloud stack. At AWS re:Invent 2023, Rivianâs Anirban Kundu, principal engineer, and Rupesh More, staff engineer, walked through the AWS stack and how it connects to Databricks.
âOne use case at Rivian is the digital twin. The intent is to model the vehicleâs behavior on the cloud, using its vehicle data,â Kundu said (see Figure 9). âAt any point in time there are hundreds of algorithms at play in the vehicle.â
Figure 9. Rivian Creates Digital Twins of Its Vehicles

Source: Rivian
Vehicle data ultimately lands in the cloud, where it is parsed, partitioned, anonymized, sanitized, and put into standard datasets. Kundu said Rivian takes in several petabytes of vehicle data through its ingestion pipelines and analytics via AWS services (see Figure 10).
âWe donât see the vehicles as independent entities. Theyâre extensions of what you can do via the cloud,â Kundu said.
Figure 10. A Look at How Rivian Leverages AWS

Source: Rivian
Rivianâs More noted that the raw data from AWS is streamed into Databricks architecture to create clean datasets that are then transformed into custom tables used for analytics. Thereâs also an event-watch architecture, which is used to surface anomalies in cabin temperature and critical signals. These events are processed using AWS services to send notifications (see Figure 11).
Kundu said the AWS and Databricks architecture must scale going forward, given that Rivian expects to have about 1 million vehicles in the fleet in the next few years as well as more data, more applications, and more use cases.
Figure 11. Rivian Blends Its AWS and Databricks Infrastructure

Source: Rivian
Autonomy, AI, and Beyond
According to Philbin, Rivian has created a âperception stackâ thatâs fed data from 11 cameras with 55 megapixels of real-time imaging, five radar systems with 1,000 feet of forward-facing detection range, and overlapping sensors for redundancy.
âThe perception and prediction stack is really the core of the new platform,â Philbin said, who previously worked at Waymo. âThis is a large, deep machine learning model that takes all of the sensors together in combination and generates this world model output.â
Radar system and camera data is fed into a neural network that uses a transformer network to translate the information into real-world models of objects, maps, predictions, and ultimately behaviors. Rivianâs transformers operate similarly to the way large language models work.
Rivian will be able to leverage these systems to make predictions about driving behaviors, Philbin explained (see Figure 12). âThe predictions are how we think agents are going to react in the near future,â he said. âWe want to add to this same model so it can learn driving behaviors from real data for planning.â
The key is to learn from human driving behaviors and incorporate the rules of the road. âWe donât want our autonomy stack to learn bad behaviors,â Philbin said.
Figure 12. Rivianâs Cloud and OS Are Designed to Create a Seamless Flow of Data

Source: Rivian
Philbin also said that Rivian has added abstraction layers to account for APIs in different chips and various sensors to ensure independence from any one compute platform or provider. âWe have that independence from any particular compute platform. And that extensibility allows us to cover a wide range of vehicle platforms and more in the future,â he said.
Add it up, and Rivianâs bet is that its data loop will drive autonomy features that will differentiate the company in the future. âAutonomy is a key competitive and street-strategic advantage for Rivian,â Philbin said. âWe think that as the systems get better and better and better over time, customers are expecting more and more from the autonomous systems available in their vehicles.â
What You Can Learn From Rivian
- Rivianâs AI strategy and future autonomy platform builds on a cloud and data strategy that can scale. Without those investments, Rivian would look more like a legacy automaker.
- Customer experience is everything. Rivianâs integrated stack approach begins and ends with the customer experience.
- You canât have good customer experience, improve margins, and leverage your unique advantages without a data flywheel.
- Given electric vehicles are a relatively new category, the technology is evolving, and Rivian must build its own platform. Many enterprises are realizing that theyâre going to need to build applications versus buying off the shelf, where suppliers are more concerned about their margins than their customersâ.
Get this story as a PDF.