Book a demo

Patent Drafting Analysis of Tesla’s Vision-Only Object Distance Estimation via ML | US 10,956,755 B2

Patent Drafting Analysis of Tesla’s Vision-Only Object Distance Estimation via ML | US 10,956,755 B2
IP Drafting Analysis · US 10,956,755 B2

Patent Drafting Analysis of Tesla's Vision-Only Object Distance Estimation Using Machine Learning | US 10,956,755 B2

A structural and strategic analysis of Tesla's camera-only distance prediction patent, examining claim architecture, drafting quality, §101 exposure, critical gaps, and prosecution positioning across all 23 claims.

US 10,956,755 B2Filed: Feb. 19, 2019Granted: Mar. 23, 2021G06K 9/00G06T 7/70G06N 20/00
Spec Words
8,200
Across 5 sections
Draft now ↗
Total Claims
23
4 independent · 19 dependent
Draft now ↗
Figure Sheets
6
System block diagram, training/inference flow diagrams, sensor capture, object detection
Draft now ↗
Published by PatSnap Insights Team · · 12 min read Verified by PatSnap Eureka Data
Overview

Structural Overview

The detailed description dominates at approximately 79% of total words (~6,500 of ~8,200), providing extensive embodiment coverage across system components, training data generation, model deployment, and inference workflows. The claim set comprises 23 claims across 4 independent claims — one system (Claim 1), one CRM (Claim 17), and two method claims (Claims 19 and 23) — with 19 dependent claims yielding a 4.75:1 dependency ratio. Six drawing sheets cover system architecture, training data flow, inference flow, multi-sensor capture, and object detection output, with FIG. 6 directly supporting the core claim limitation of outputting distance from image-only input.

Section Word Distribution

Detailed Desc. 6500 w Claims 800 w Summary 290 w Background 230 w Brief Desc. 170 w Abstract 70 w ↗ Click bars to explore

Figure Inventory — 6 Sheets

FigureDescriptionRole
FIG. 1
Block diagram of the deep learning system 100 for autonomous driving, showing vision sensors 101, additional sensors 103, image pre-processor 105, deep learning network 107, AI processor 109, vehicle control module 111, and network interface 113.Search in Eureka ↗
System architecture
FIG. 2
Flow diagram illustrating the 5-step training data creation process: receive vision data (201), receive related data (203), identify objects (205), determine ground truth (207), and package training data (209).Search in Eureka ↗
Flow diagram
FIG. 3
Flow diagram showing the end-to-end ML lifecycle: prepare training data (301), train model (303), deploy model (305), receive sensor data (307), apply trained model (309), and control vehicle (311).Search in Eureka ↗
Flow diagram
FIG. 4
Flow diagram illustrating autonomous driving inference pipeline: receive sensor data (401), pre-process (403), initiate deep learning analysis (405), provide results to vehicle control (407), control vehicle (409), and transmit sensor/related data (411).Search in Eureka ↗
Flow diagram
FIG. 5
Overhead diagram of autonomous vehicle 501 with forward sensor 503 and right-side sensor 553, showing fields of view 509/559, distance vectors 513/523/563, and bounding boxes around neighboring vehicles 511, 521, and 561 used to generate training ground truths.Search in Eureka ↗
Key embodiment
FIG. 6
Analyzed vision data 601 from a forward-facing camera showing detected lane lines 603–609 and bounding boxes 611–619 for multiple vehicles, illustrating the inference-time output of the trained machine learning model predicting object distances from image data only.Search in Eureka ↗
Claim support
Analysis powered by PatSnap Eureka. Patent text and figures publicly available from USPTO. Draft a Similar Patent
Claims

Claim Architecture Analysis

The patent contains 4 independent claims: Claim 1 (system/apparatus), Claim 17 (CRM/computer program product), Claim 19 (method — training pipeline), and Claim 23 (method — on-vehicle inference), providing tripartite enforcement coverage. The 19 dependent claims against 4 independent claims yields a 4.75:1 ratio, which is slightly below the software/AI industry norm of 5–8:1, leaving some dependent claim real estate unused. The structural split between a training-focused method claim (Claim 19) and an inference-focused method claim (Claim 23) is a strategically sound choice, but both method claims share nearly identical preambles with the system claim, offering limited differentiation.

Core inventive concept: The claims address the high cost and complexity of emitting distance sensors (radar, lidar, ultrasonic) in autonomous vehicles by training a machine learning model using correlated camera images and emitting distance sensor outputs — then deploying a model that outputs object distance "using only the image data" (Claim 1) or "without using distance information from a first emitting distance sensor" (Claim 23), eliminating the need for dedicated distance sensors at inference time.

Independent Claim Dissection

ClaimPreambleTransitionKey Body Elements
Claim 1A system, comprising: one or more processors configured to…; and a memory coupled to the one or more processorscomprising
receive image data from camera of vehicle depicting object in surrounding environment; utilize image data as input to trained ML model outputting distance from vehicle to object using only image data; wherein trained ML model was trained using training image and correlated output of emitting distance sensor; memory coupled to processorsSearch prior art ↗
Claim 17A computer program product, the computer program product being embodied in a non-transitory computer readable storage medium and comprising computer instructions for:comprising
receiving image data from camera of vehicle depicting object in surrounding environment; utilizing image data as input to trained ML model outputting distance using only image data; wherein trained ML model was trained using training image and correlated output of emitting distance sensorSearch prior art ↗
Claim 19A method, comprising:comprising
receiving selected image from camera of vehicle; receiving distance data from emitting distance sensor; identifying object using selected image as input to trained ML model; extracting distance estimate from received distance data; creating training image by annotating selected image with extracted distance estimate; training second ML model to predict distance measurement using training data set including training image; providing trained second ML model to second vehicle with second cameraSearch prior art ↗
Claim 23A method implemented by a processor included in a vehicle, the method comprising:comprising
receiving image data from camera of vehicle depicting object in surrounding environment; utilizing image data as input to trained ML model outputting distance without using distance information from first emitting distance sensor; wherein trained ML model was trained using training image and correlated output of second emitting distance sensorSearch prior art ↗

Claim Dependency Tree

1 System: processors receive camera image data, use trained ML model to output object distance using only image data; model trained with emitting distance sensor ground truthSearch Claim 1 prior art ↗
2 Adds: trained ML model also outputs direction of object with respect to vehicleSearch in Eureka ↗
3 Adds: processors further configured to identify velocity vector of object with respect to vehicleSearch in Eureka ↗
4 Adds: emitting distance sensor is a radar sensorSearch in Eureka ↗
5 Adds: emitting distance sensor is an ultrasonic sensorSearch in Eureka ↗
6 Adds: emitting distance sensor is a lidar sensorSearch in Eureka ↗
7 Adds: object is one of pedestrian, vehicle, obstacle, barrier, or traffic control objectSearch in Eureka ↗
8 Adds: training image and correlated emitting distance sensor output captured on a training vehicleSearch in Eureka ↗
9 Further: emitting distance sensor output includes estimated distance and direction of second object identified in training image from training vehicleSearch in Eureka ↗
10 Further: training image is one of a time series of images captured using image sensor of training vehicleSearch in Eureka ↗
11 Further: second object identified at least in part by analyzing plurality of images in time seriesSearch in Eureka ↗
12 Further: time series of images used to disambiguate emitting sensor output for second object among plurality of detected objectsSearch in Eureka ↗
13 Adds: identified distance of object used to predict a semantic label associated with the objectSearch in Eureka ↗
14 Adds: trained ML model was trained to predict distance output of emitting distance sensor based on input imageSearch in Eureka ↗
15 Adds: ML model distance output combined with output of installed emitting distance sensor to determine relative location of objectSearch in Eureka ↗
16 Adds: trained ML model outputs plurality of distances corresponding to points on bounding box associated with the object as depicted in image dataSearch in Eureka ↗
17 CRM: non-transitory storage medium with instructions to receive camera image data and utilize as input to trained ML model outputting distance using only image data, trained with emitting distance sensor ground truthSearch Claim 17 prior art ↗
18 Adds: trained ML model outputs plurality of distances corresponding to points on bounding box of object as depicted in image dataSearch in Eureka ↗
19 Method (training pipeline): receive selected image and emitting distance sensor data, identify object via ML model, extract distance estimate, annotate training image, train second ML model, provide to second vehicleSearch Claim 19 prior art ↗
20 Adds: captured image is part of time series of images; distance data is part of time series of captured distance dataSearch in Eureka ↗
21 Further: tracking identified object across time series; extracted distance estimate based on correlating tracked object to received distance dataSearch in Eureka ↗
22 Adds: identified object is a detected vehicle or pedestrianSearch in Eureka ↗
23 Method (on-vehicle inference): processor in vehicle receives camera image data, utilizes input to trained ML model outputting distance without using distance from first emitting distance sensor; model trained using training image and correlated output of second emitting distance sensorSearch Claim 23 prior art ↗
MetricThis ApplicationSoftware / AI / Autonomous Vehicle Norm
Total claims2320 – 30
Independent claim count43 – 5
Dependent : Independent ratio4.75 : 15 – 8 : 1
Method claims present?Yes — Claims 19, 23Common
System / apparatus claims?Yes — Claim 1Common
Analysis powered by PatSnap Eureka. Patent text and figures publicly available from USPTO. Draft a Similar Patent
Drafting Quality

Drafting Quality Signals

The patent demonstrates strong structural coverage — four independent claim types (system, CRM, training method, inference method) provide broad enforcement reach, and the key 'using only the image data' limitation in Claims 1, 17, and 23 is directly supported by FIG. 6 and the detailed description. However, the dependent claims are heavily clustered on Claim 1 (Claims 2–16), leaving Claims 17 and 23 with only one dependent claim each, which creates shallow fallback positions for the CRM and on-vehicle inference branches.

Antecedent Basis
Antecedent basis is well-maintained throughout the claim set. The phrase "the trained machine learning model" in Claim 1's body is properly introduced in the same claim's limitations. In Claim 9's "the second object" is introduced by "a second object" in the same claim. In Claim 19, "the identified object" is properly introduced by "identifying an object" and "the received distance data" tracks back to "receiving distance data." No antecedent basis violations were identified across all 23 claims.
Spec–Claim Consistency
The core limitation in Claims 1, 17, and 23 — that the trained ML model outputs distance "using only the image data" — maps directly to FIG. 6 (analyzed vision data 601 showing bounding boxes 611–619 with predicted properties) and the detailed description col. 3 ll. 1–5 and col. 13 ll. 1–10. The training pipeline limitations of Claim 19 (annotating image with distance estimate, training second model) are supported by FIG. 2 steps 201–209 and col. 8–12. The "time series" limitations of Claims 10–12, 20–21 are supported by col. 9 ll. 1–55. All independent claim limitations map to specific figures and paragraphs.
Transition Word Usage
All four independent claims use "comprising," the broadest available transition word, which is strategically appropriate for a software/AI patent where competitors may implement the core concept with additional components. The open-ended "comprising" in Claim 1 allows infringement even when an accused system also includes a dedicated distance sensor at deployment, which is commercially significant given Tesla's own fleet use of radar alongside cameras. No "consisting of" language appears, avoiding unnecessary narrowing.
§112(f) Means-Plus-Function Risk
No "means for" or "step for" language appears in any of the 23 claims. Functional language such as "configured to receive," "configured to output," and "configured to identify" in Claim 1 is paired with structural recitation of "one or more processors" and "a memory coupled to the one or more processors," satisfying the structural disclosure requirement and avoiding §112(f) treatment. The specification provides extensive structural support for the processor/memory components through FIG. 1 (AI processor 109, deep learning network 107), further insulating against §112(f) challenges.
⚠️
§101 Eligibility Risk
Claims 1 and 17 carry moderate Alice exposure because the core concept — using a neural network to predict distances from images — could be characterized as an abstract idea (mathematical model predicting sensor output) applied to a general processor/storage medium. The §101 defense relies on the hardware tie-in: "one or more processors" in Claim 1 and "a camera of a vehicle" limiting the input source, plus the vehicular control application context. However, the CRM claim (Claim 17) lacks explicit vehicle control output limitations, which weakens its concrete-application anchor relative to Claim 1. Claims 19 and 23, which recite a specific training pipeline and a processor-in-vehicle respectively, have stronger eligibility posture due to their concrete operational steps.
⚠️
Dependent Claim Fallback Quality
Nineteen of the 19 dependent claims depend from Claim 1 or Claim 9/10, leaving Claims 17 and 23 each with only one dependent claim (Claims 18 and none respectively — actually Claim 18 depends from 17, and Claim 23 has no dependents at all). This means if Claim 1 is invalidated, the CRM coverage of Claim 17 has only one fallback (Claim 18's bounding box limitation), and Claim 23 has zero fallback positions. Among Claims 2–16, substantively distinct fallbacks include Claim 13 (semantic label prediction), Claim 15 (combination with installed sensor), and Claim 16 (bounding box distances), but Claims 4–6 (sensor type specificity) add only minor differentiation against a determined invalidity challenger.
⚠️
Abstract Quality
The abstract reads: "The trained machine learning model has been trained using a training image and a correlated output of an emitting distance sensor" — which correctly identifies the novel training mechanism. However, the abstract focuses on the system claim framing only and does not mention the key differentiating limitation that the model operates "using only the image data" at inference time, which is the commercially most significant aspect of the invention (eliminating the dedicated distance sensor at deployment). An examiner reading only the abstract could easily conflate this with standard sensor fusion approaches, potentially triggering broader prior art searches that miss the point of novelty.
Figure Support Quality
Figure support is comprehensive for the independent claim limitations. FIG. 1 supports the processor/memory/camera system architecture of Claim 1. FIG. 5 directly supports the training data collection mechanism (emitting distance sensor correlated with training image) recited in Claims 1, 17, and 19. FIG. 6 supports the inference-time output of distance from image data only. The bounding box limitations of Claims 16 and 18 are supported by FIG. 6 (bounding boxes 611–619) and the detailed description at col. 15 ll. 20–30. The time series limitations of Claims 10–12 and 20–21 are textually supported but lack a dedicated figure showing the temporal tracking process, which is a minor gap.
Analysis powered by PatSnap Eureka. Patent text and figures publicly available from USPTO. Draft a Similar Patent
Scorecard

Strategic Intent Scorecard

Multi-dimensional assessment of this application's patent strategy quality, based on claim structure, specification depth, and prosecution positioning.

Claim Breadth
3.8
Prosecution Defensibility
3.5
Spec–Claim Consistency
4.2
Dependent Claim Coverage
2.8
Claim Type Diversity
4.5
Figure Support Quality
4
Breadth Prosecution Consistency Dep. Coverage Claim Types Figures
Key observation: Claim Type Diversity scores highest (4.5/5) because the patent covers all four relevant claim types — system, CRM, training method, and on-vehicle inference method — providing enforcement pathways against system manufacturers, software distributors, training pipeline operators, and vehicle operators alike. Dependent Claim Coverage scores lowest (2.8/5) because Claim 23 (the on-vehicle inference method) carries zero dependent claims and Claim 17 (the CRM) has only one, meaning that if either independent claim is successfully challenged, there are no fallback positions to narrow to — a significant litigation vulnerability practitioners should address in any continuation filing. The practical implication is that a continuation application adding 10–15 dependent claims to Claims 17 and 23 with limitations drawn from the detailed description (e.g., time-series disambiguation, semantic labeling, 3D bounding boxes) would substantially improve the portfolio's invalidity resilience.
See how your own draft compares — Open Eureka IP Drafting →
Critical Gaps

3 Critical Gaps in This Claim Set

A senior-attorney lens on the three highest-priority structural weaknesses — what each exposes in prosecution and litigation, and what a stronger filing would have done differently.

🔒

3 Critical Gaps in This Claim Set

See the full attorney-level analysis of what this application leaves unprotected — and how to draft it more defensively for your own filings.

Zero dependents on Claim 23 Fleet data collection unclaimed Training server apparatus not claimed
Unlock Full Analysis — Free
Frequently asked questions

US 10,956,755 B2 — key questions answered

Still have questions? PatSnap Eureka can answer them from patent data instantly. Search in Eureka
PatSnap Eureka

Ready to Draft Your Next Patent with AI?

PatSnap Eureka's AI drafting agent writes structured claims, flags coverage gaps, and positions your application for prosecution success.

Disclaimer: This analysis is generated by PatSnap Eureka AI based on publicly available patent data from the USPTO. It does not constitute legal advice and should not be relied upon as such. Patent data may be subject to change as prosecution progresses. Scores and assessments reflect automated analysis and may not capture all relevant legal or technical nuances. Always consult a qualified patent attorney for formal legal opinions on patentability, freedom to operate, or infringement.

Ask anything about this patent.
PatSnap Eureka searches patents and data to answer instantly.
Powered by PatSnap Eureka
Link copied to clipboard

Help us improve this page

Found incorrect or outdated information? Let us know and we'll get it fixed.