From Complementary Sensors to Fused Intelligence
Lidar-camera sensor fusion combines 3D point cloud depth data from lidar with high-resolution visual information from cameras to produce richer, more robust environmental representations than either sensor can achieve alone. Lidar supplies accurate metric depth, 3D geometry, and range data that is largely insensitive to lighting conditions, while cameras provide the texture, colour, semantic content, and classification information that lidar cannot produce independently. Together, the two modalities address each other’s fundamental limitations — a pairing that has become a critical enabler for autonomous vehicles, mobile robotics, and industrial safety systems.
The fusion pipeline is typically divided into three processing stages. Data-level (early) fusion integrates raw sensor streams before any feature extraction; feature-level (mid) fusion combines intermediate neural representations; and decision/object-level (late) fusion merges independently produced detections. Each stage involves fundamental tradeoffs between computational cost, modality complementarity, and real-time performance. According to WIPO, multi-modal sensor integration is among the fastest-growing technology areas in autonomous systems patent filings globally.
Early fusion integrates raw sensor data before feature extraction. Mid-level fusion combines intermediate neural representations from each sensor. Late fusion merges independently produced object-level detections from each modality. The choice of stage determines computational cost, latency, and how well the model exploits cross-modal complementarity.
This analysis covers 70+ patent records spanning 2014 to 2026, drawn from 12 jurisdictions and at least 30 distinct assignees. Core sub-domains include hardware-synchronized lidar-camera integration, extrinsic calibration and dynamic alignment, deep neural network-based feature fusion, and multi-modal object detection and tracking pipelines. The dataset spans foundational Bayesian approaches filed by General Motors in 2014 through to NeRF-based temporal alignment filed by Qualcomm in 2025.
Lidar-camera sensor fusion patent analysis covering 70+ records from 2014–2026 spans 12 jurisdictions and at least 30 distinct assignees, with China (CN) identified as the single largest jurisdiction by filing count, representing records from at least 20 distinct Chinese assignees.
How the Innovation Timeline Evolved: 2014–2026
The lidar-camera sensor fusion patent landscape divides into three distinct phases, each characterised by a dominant technical paradigm: statistical foundations, deep learning integration, and calibration intensification.
The foundational phase (2014–2019) established the mathematical and architectural frameworks for multi-sensor data combination. General Motors Global Technology Operations filed two Chinese patents in 2014 covering Bayesian network fusion of multiple lidar scan returns for object tracking — establishing the statistical foundations that subsequent work builds upon. FedEx Corporate Services’ multi-sensor collision avoidance system (CA, 2019) signalled early industrial adoption beyond the automotive sector.
The deep learning integration phase (2020–2022) is characterised by the rapid adoption of convolutional neural networks and transformer architectures. Mobileye Vision Technologies’ multi-family vehicle navigation patents (JP, 2020–2024) anchored image-lidar attribution as a core navigation primitive. Waymo LLC filed its rolling-shutter camera synchronisation system across multiple jurisdictions (IL, 2021–2022), demonstrating production-grade hardware co-design. StradVision filed its multi-stage convolutional fusion approach (EP, 2020; IN, 2024), integrating camera images and point cloud maps at each convolution stage for HD map updates.
The maturation and calibration intensification phase (2023–2026) reflects convergence around automated calibration and neural-guided fusion. Motional AD LLC filed camera-to-lidar calibration and validation pipelines in KR and DE (2021–2024). GM Global Technology Operations LLC filed a dynamic in-vehicle alignment method (US/CN, 2025) and a robust alignment approach (CN, 2025). Qualcomm filed a lidar-camera spatio-temporal alignment method using Neural Radiance Fields (CN, 2025), signalling a new paradigm for temporal synchronisation. The most recent active filings from 2025–2026 from Shenzhen Yinwang Intelligent Technologies, Beijing Sankuai Online Technology, and Huawei Technologies reflect continued commercial industrialisation.
“The emergence of LiDAR-guided label generation for camera-radar models signals a strategic repositioning of lidar from a mandatory runtime sensor to a high-cost development tool — with broad licensing implications.”
Four Technical Clusters Defining the Patent Landscape
Patent activity in lidar-camera sensor fusion organises into four technically distinct clusters, each addressing a different layer of the fusion stack — from hardware co-design through to object-level tracking pipelines.
Cluster 1: Hardware-Level Synchronisation and Co-Design
This cluster addresses the physical challenge of temporal and spatial alignment between rotating lidar systems and rolling-shutter cameras, which operate on fundamentally different timing models. Waymo LLC’s Synchronized Spinning LIDAR and Rolling Shutter Camera System (IL, 2022) aligns rows of camera sensing elements with the lidar’s rotation axis; a controller sequences camera exposure rows to match the lidar’s angular scan position, achieving pixel-level temporal co-registration. A companion Waymo patent (CN, 2024) describes a single rotating platform that co-mounts the lidar transmitter, lidar receiver, and image sensor, eliminating parallax and enabling direct pixel-to-point mapping without stereo-correction computation overhead. Beijing Sankuai Online Technology’s rotation-triggered image acquisition system (US, 2023) uses the lidar’s real-time rotation angle to trigger image capture from the camera covering the same angular interval.
Waymo’s co-mounted rotating platform patent (CN, 2024) eliminates parallax between lidar and camera by mounting both sensors on a single rotating platform, enabling direct pixel-to-point mapping without stereo-correction computation overhead — a significant reduction in processing complexity for production autonomous driving systems.
Cluster 2: Extrinsic Calibration and Dynamic Alignment
Accurate 6-degree-of-freedom spatial transformation between the lidar and camera coordinate frames is the prerequisite for any fusion operation. Motional AD LLC’s automated pipeline (KR, 2024) combines 3D plane, vector, and point correspondences from a single camera image and lidar scan to estimate and validate extrinsic rotation and translation parameters — a “single-shot” approach that eliminates the need for calibration targets. A companion Motional AD filing (CN, 2025) uses a transformer-based backbone with self-attention to infer 6-DoF relative transformation, enabling online real-time recalibration that adapts to vibration, weather, and mechanical changes. Huawei Technologies (KR, 2024) takes a generative approach: a neural network ingests camera images to produce a simulated lidar point cloud, which is then matched against the real lidar scan to calibrate spatial alignment.
Map the full calibration patent landscape with PatSnap Eureka’s AI-powered search across 170M+ patent documents.
Explore Patent Data in PatSnap Eureka →Cluster 3: Deep Learning Feature-Level and Early Fusion
This cluster covers methods that integrate raw or intermediate representations from both sensors within a neural network, enabling the model to learn joint feature spaces. StradVision’s learning method (EP, 2020; IN, 2024) fuses camera feature maps and corresponding point cloud maps via an integration layer at each convolutional stage, producing hierarchically enriched fused feature maps for object detection and segmentation with distance estimation. Beijing Qingzhouzhihang’s asynchronous fusion method (US, 2022) explicitly decouples the two sensor streams: current-frame lidar 3D embeddings are fused with a previous-frame hidden state via temporal processing, generating a temporary hidden state that is then combined with the current camera frame to reduce processing latency. NVIDIA Corporation’s multi-view deep neural networks (JP, 2024) process 3D environment data through chained constituent networks operating in perspective and bird’s-eye views, outputting 2D/3D bounding boxes for use in autonomous vehicle planning stacks.
Cluster 4: Object-Level (Late) Fusion and Multi-Target Tracking
Late fusion pipelines allow lidar and camera to independently produce detections that are subsequently combined via association algorithms, Kalman filtering, and bounding-box IoU matching. Tata Consultancy Services’ fusion-based object tracker (US, 2023) independently processes 3D lidar point clouds, 2D camera images, and bird’s-eye view projections; a panoptic segmentation output identifies non-occluded object regions, eliminating spurious lidar points and producing accurate 3D positions for seamless 360° tracking. Hyundai Motor Company’s object tracking method (US, 2025) creates lidar tracks and non-final sensor fusion tracks from camera/radar data, then assesses heading angle consistency to determine the final heading angle of the fusion track — directly addressing angular estimation instability. Ford Global Technologies’ camera-lidar fusion object detection system (CN, 2023) prunes the lidar dataset by computing the probability distribution of detection assignments for each point, resolving occlusion ambiguities for closely spaced or partially occluded objects. Standards bodies such as IEEE have published reference architectures for sensor fusion pipelines that align with these late-fusion approaches.
Among lidar-camera sensor fusion patent filers analysed from 2014–2026, Motional AD LLC holds the largest multi-jurisdiction presence with 7+ filings across US, DE, KR, and CN, focused on calibration and information merging. Waymo LLC and Tata Consultancy Services Limited each hold 5+ filings in this dataset.
Geographic and Assignee Concentration
China is the single largest jurisdiction by filing count in this dataset, with records from at least 20 distinct Chinese assignees — spanning major automotive OEMs, tier-1 technology companies (Baidu, Huawei, Qualcomm China filings), autonomous driving startups, and universities including Beijing Institute of Technology, Tongji University, Chongqing University, the Chinese Academy of Sciences, and others. This volume reflects both domestic innovation and international players filing patent protection in the Chinese market.
Japan is the second most represented jurisdiction, primarily through PCT national phase entries from Waymo, Mobileye, Tata Consultancy Services, Motional AD, Baidu Times Technology, and FedEx — indicating that foreign companies view Japan as a critical protection market. Korea features filings from Hyundai Motor Company, Motional AD, Qualcomm Korea, LG Electronics, Ouster, Korea Institute of Construction Technology, and Yonsei University, indicating a domestic automotive and robotics ecosystem with significant external IP. The United States includes cross-Pacific filers such as Tata Consultancy Services, GM Global Technology Operations, Motional AD, GM Cruise Holdings, Beijing Qingzhouzhihang, and Beijing Sankuai Online Technology. Germany contains filings from Infineon Technologies AG (lidar hardware), Motional AD, and Robert Bosch (Licam integrated sensor unit).
Innovation is moderately concentrated among a small set of autonomous driving platform companies — Waymo, Motional, Mobileye — and Chinese technology companies for deployment applications, with significant academic filing activity concentrated in China. This pattern mirrors broader autonomous systems IP trends tracked by EPO in its annual patent index, which identifies autonomous mobility as one of the top five technology areas by filing growth rate.
In the lidar-camera sensor fusion patent dataset covering 2014–2026, China (CN) is the single largest jurisdiction by filing count with at least 20 distinct Chinese assignees, followed by Japan (JP) as the second most represented jurisdiction primarily through PCT national phase entries from foreign companies including Waymo, Mobileye, Tata Consultancy Services, and FedEx.
The FedEx multi-jurisdiction filing strategy (CA 2019; JP 2021, 2021; CN 2024) covering lidar-camera fusion for collision avoidance in airport ground support vehicles represents a mature industrial deployment paradigm outside automotive. The Korea Institute of Construction Technology’s infrastructure monitoring system (KR, 2023) — designed to fuse incident data for dissemination to public agencies and autonomous driving vehicles — points to an emerging non-automotive vertical. As noted by OECD in its innovation outlook reports, industrial logistics and smart infrastructure represent undertapped application domains relative to the volume of automotive-focused IP.
Five Emerging Directions Reshaping Fusion Architecture
Among the most recent filings (2024–2026) in this dataset, five distinct forward-looking directions are visible, each with implications for IP strategy and R&D investment prioritisation.
1. NeRF-Based Spatio-Temporal Alignment. Qualcomm’s LiDAR-Camera Spatio-Temporal Alignment Using Neural Radiance Fields (CN, 2025) uses NeRF models and kinematic information to synthesise temporally synchronised camera images matched to lidar point clouds — bypassing hardware-level synchronisation constraints and enabling training data augmentation. This approach decouples the synchronisation problem from hardware design, potentially making it applicable across heterogeneous sensor configurations.
2. Transformer-Based Online Calibration. Motional AD’s transformer-backbone calibration model (CN, 2025) and Huawei’s neural network spatial calibration via simulated point clouds (KR, 2024) represent a shift from geometric feature-based offline calibration toward learned, continuous online recalibration that can handle real-world sensor drift caused by vibration, weather, and mechanical changes.
3. Trajectory-Based Cross-Modal Correspondence for Calibration-Free Fusion. Huawei Technologies’ Method for Multi-Modal Sensor Fusion Using Object Trajectories for Cross-Domain Correspondence (EP, 2025) computes correspondence between sensor observations by matching spatiotemporal object trajectories across domains using a distance metric — potentially enabling fusion without explicit extrinsic calibration.
4. Dynamic Vision Sensor (Event Camera) and LiDAR Fusion. Shenzhen Jiutian Ruixin Technology’s Target Recognition Method Based on Dynamic Vision Sensor and LiDAR Fusion (CN, 2024) integrates event-camera (neuromorphic sensor) data with lidar using a compute-in-memory CNN architecture, targeting high-speed, low-latency target recognition with reduced power consumption.
5. LiDAR as a Training Oracle for Camera-Radar Fusion Models. GM Cruise Holdings’ Systems and Techniques for Using LiDAR Guided Labels to Train a Camera-Radar Fusion Machine Learning Model (US, 2025) uses lidar detections as ground truth to supervise camera-radar fusion model training — positioning lidar as a labelling oracle rather than an online sensor, enabling cost-optimised deployment with camera-radar-only inference at scale.
Track NeRF-based fusion and transformer calibration patents as they publish — set up AI-powered alerts in PatSnap Eureka.
Monitor Emerging Patents in PatSnap Eureka →Strategic Implications for IP Teams
The lidar-camera sensor fusion patent landscape presents four specific strategic considerations for IP and R&D teams assessing freedom-to-operate, white-space opportunities, and licensing exposure.
Calibration is a core IP battleground. A disproportionate share of recent filings across US, CN, KR, and DE jurisdictions target extrinsic calibration and dynamic alignment. R&D teams entering this space should expect dense prior art from Motional AD, GM, Waymo, and Mobileye, and should consider differentiation via online/continuous calibration without fiducial targets or known environments.
Deep learning fusion is fragmenting into specialised architectures. Early homogeneous CNN fusion approaches are giving way to transformer-based, NeRF-based, and trajectory-domain fusion — each requiring different training infrastructure and offering different latency profiles. IP strategists should monitor which architecture gains traction in production autonomous driving stacks, as foundational patents in the winning architecture will carry high licensing value.
China represents both the largest filing volume and the most competitive market. With 20+ Chinese assignees active in this space — including national universities, automotive OEMs, and hardware companies — any company seeking CN protection needs a comprehensive freedom-to-operate analysis before product launch. The PatSnap IP management platform provides jurisdiction-level prior art mapping across all major filing offices.
Industrial logistics and infrastructure are undertapped compared to automotive. FedEx’s multi-jurisdiction filings in collision avoidance and the Korea Institute of Construction Technology’s infrastructure monitoring work suggest that non-automotive verticals remain relatively open for IP development, particularly in industrial vehicles, construction equipment, and smart infrastructure.
LiDAR as a training oracle, not just an online sensor. The emergence of lidar-guided label generation for camera-radar models (GM Cruise, 2025) signals a strategic repositioning of lidar from a mandatory runtime sensor to a high-cost development tool. Companies that own this training methodology will be able to license it broadly as camera-radar-only systems proliferate in cost-sensitive markets. The PatSnap Insights blog covers related autonomous systems IP developments as they emerge.
GM Cruise Holdings filed a 2025 US patent titled “Systems and Techniques for Using LiDAR Guided Labels to Train a Camera-Radar Fusion Machine Learning Model,” which uses lidar detections as ground truth labels to supervise camera-radar fusion model training — positioning lidar as a labelling oracle for cost-optimised camera-radar-only deployment rather than a mandatory runtime sensor.