Patent Drafting Analysis of Origin Research Wireless’s Radio-Based Voice Activity Detection System | US 12,272,375 B2
Patent Drafting Analysis of Origin Research Wireless's Radio-Based Voice Activity Detection System | US 12,272,375 B2
A structural and strategic analysis of US 12,272,375 B2, examining claim architecture, drafting quality, dependent claim fallback coverage, §101 eligibility exposure, and prosecution positioning for this mmWave-based voice activity detection patent.
Structural Overview
The detailed description dominates at approximately 85% of total words (~34,000 words), reflecting a highly expansive specification that incorporates broad background technical boilerplate characteristic of this applicant's patent family. The claim set comprises 18 claims total — 2 independent (Claims 1 and 18 covering system and method respectively) and 16 dependent claims — yielding an 8:1 dependent-to-independent ratio. The 32 drawing sheets provide extensive visual coverage spanning system overviews, radar feature maps, neural network architectures, signal spectrograms, and performance comparison charts.
Section Word Distribution
↗ Click bars to exploreFigure Inventory — 32 Sheets
| Figure | Description | Role |
|---|---|---|
| FIG. 1 | Overview of speech enhancement and separation (RadioSES) system showing mmWave sensing and acoustic sensing modules feeding a Speech Enhancement and Separation block outputting per-person audio streams.Search in Eureka ↗ | System architecture |
| FIG. 2 | Block diagram of RadioSES system 200 showing mmWave radar (Tx 211, Rx 212), smart speaker 221, source detection and localization block 201, and RadioSESNet deep learning module 202.Search in Eureka ↗ | Key embodiment |
| FIG. 3 | Constant false alarm rate (CFAR) window in range-azimuth space showing cell under test, guard cells, and training cells for target detection.Search in Eureka ↗ | Claim support |
| FIG. 4 | Amplitude map of range-azimuth plane for radio feature extraction showing two speaker locations at approximately 1 m and 1.5 m range.Search in Eureka ↗ | Claim support |
| FIG. 5 | Variance map for radio feature extraction showing the variance of channel impulse response at each range-azimuth bin to identify stationary versus dynamic objects.Search in Eureka ↗ | Claim support |
| FIG. 6 | Detection map for radio feature extraction showing binary detection output at specific range-azimuth bins corresponding to two speaker locations.Search in Eureka ↗ | Claim support |
| FIG. 7 | DBSCAN clustering output showing localized positions of Person I and Person II in range-azimuth space used for speaker number estimation.Search in Eureka ↗ | Claim support |
| FIG. 8A | Unimodal (audio-only) RadioSESNet architecture showing encoder, masker with time-frequency representation, multiplicative masks, and dual decoders producing per-speaker audio outputs.Search in Eureka ↗ | Key embodiment |
| FIG. 8B | Multimodal (audio-radio) RadioSESNet architecture extending FIG. 8A with a parallel radio encoder branch feeding radio features into the masker alongside audio features.Search in Eureka ↗ | Key embodiment |
| FIG. 9 | Detailed RadioSESNet masker structure showing adaptive encoders for radio and audio, Radio/Audio DPRNN blocks, Fusion+DPRNN(×4) multimodal masker, and adaptive decoders outputting Audio Mask 1 and Audio Mask 2.Search in Eureka ↗ | Key embodiment |
| FIG. 10 | DPRNN block workflow showing input reshaping operation, intra-block BLSTM(T) processing, inter-block BLSTM(S) processing, and output block with normalization layers.Search in Eureka ↗ | Key embodiment |
| FIG. 11 | Learning curves comparing audio-radio (AR) system versus audio-only (AO) system across training epochs for clean and noisy separation validation sets.Search in Eureka ↗ | Other |
| FIG. 12 | Scatter plot comparing output SiSDR of audio-radio versus audio-only systems across varying input SiSDR levels showing consistent performance gains of AR approach.Search in Eureka ↗ | Other |
| FIG. 13 | Differential gain (ΔSiSDR) scatter plot showing relative improvement of audio-radio over audio-only system with median improvement line across input SiSDR values.Search in Eureka ↗ | Other |
| FIG. 14A | Experimental setup showing distances (50 cm, 75 cm, 100 cm) from collocated microphone and radar device to seated speakers for separation performance evaluation.Search in Eureka ↗ | Other |
| FIG. 14B | Orientation variation experimental setup showing speaker at 75 cm distance from device at angle θ relative to radar boresight.Search in Eureka ↗ | Other |
| FIG. 14C | Head orientation variation experimental setup showing speaker at 50 cm rotating head at angle θ to test robustness of system to head movement.Search in Eureka ↗ | Other |
| FIG. 15 | Block diagram of Bot 1500 (Type 1 / transmitter device) showing processor 1502, memory 1504, transceiver 1510, synchronization controller 1506, power module 1508, and wireless signal generator 1522.Search in Eureka ↗ | System architecture |
| FIG. 16 | Block diagram of Origin 1600 (Type 2 / receiver device) showing processor 1602, memory 1604, transceiver 1610, synchronization controller 1606, channel information extractor 1620, and motion detector 1622.Search in Eureka ↗ | System architecture |
| FIG. 17 | Flow chart of method 1700 for radio-assisted signal estimation: obtain baseband mixture signal, obtain radio feature, construct adaptive filter, filter mixture signal, generate source estimation.Search in Eureka ↗ | Flow diagram |
| FIG. 18 | System 1800 for radio-assisted signal estimation showing sensor 1810, transmitter 1820 sending first radio signal through wireless channel 1840, receiver 1830 receiving second radio signal, and processor 1835.Search in Eureka ↗ | Key embodiment |
| FIG. 19 | First adaptive filter 1900 architecture with first baseband filter 1910, second baseband filter 1920, third filter 1930, and fourth filter 1940 producing first output signal 1909.Search in Eureka ↗ | Key embodiment |
| FIG. 20 | Detailed diagram of first adaptive filter 1900 showing first pre-processing 2011, first transformation 2012, transformed-domain filters, third pre-processing 2031, fourth transformed-domain filter 2041, and post-processing 2043.Search in Eureka ↗ | Key embodiment |
| FIG. 21A | Microphone spectrogram showing frequency components of background noise, target speech, and interference signals over a 20-second recording period.Search in Eureka ↗ | Claim support |
| FIG. 21B | Radio spectrogram showing frequency components up to 400 Hz captured by mmWave radar, demonstrating radio-only captures target speaker vibration without background noise or interference.Search in Eureka ↗ | Claim support |
| FIG. 21C | Comparative VAD output timeline showing Audio-VAD and Silero-VAD triggering false alarms during interference while Radio-VAD correctly detects only the target speaker's voice activity.Search in Eureka ↗ | Claim support |
| FIG. 22 | Overview diagram of VAD system 2200 with smart device 2210 (radio + mic), mmWave-based VAD 2220, and microphone recording and processing subsystem 2230 showing sound vs. vibration sensing paths.Search in Eureka ↗ | System architecture |
| FIG. 23 | Neural network architecture for radio-based VAD showing Conv32@[2×16] encoder, BiLSTM(dim=1)×4 blocks, LSTM(dim=2) inter-block, fully connected layers, overlap-and-add, downsample, producing VAD output.Search in Eureka ↗ | Key embodiment |
| FIG. 24A | Bar chart comparing Radio-VAD, Audio-VAD, and Silero VAD on accuracy, precision, recall, and F1-score metrics in test set I (closed condition — seen users).Search in Eureka ↗ | Other |
| FIG. 24B | Bar chart comparing Radio-VAD, Audio-VAD, and Silero VAD on accuracy, precision, recall, and F1-score metrics in test set II (open condition — unseen users).Search in Eureka ↗ | Other |
| FIG. 25 | Audio-radio multimodal VAD framework showing dual encoders for radio and audio inputs, concatenation of modalities, DPRNN block, and decoder producing VAD output.Search in Eureka ↗ | Key embodiment |
| FIG. 26 | Flow chart of method 2600 for radio-based VAD: obtain radio signal through wireless channel (2602), compute time series of CI (2604), detect voice activity without any other signal (2606).Search in Eureka ↗ | Flow diagram |
Claim Architecture Analysis
The patent contains 2 independent claims — Claim 1 (system) and Claim 18 (method) — with 16 dependent claims providing a 8:1 dependent-to-independent ratio, significantly above the typical norm of 4–8:1 for wireless communications patents, suggesting meaningful layering of fallback positions. Both independent claims are structured with a transmitter/receiver/processor triumvirate and recite a distinctive VAD limitation — detecting voice activity of a target source 'without using any media signal' — establishing the interference-resilient, radio-only detection as the core novel contribution. Notably absent is a computer-readable medium (CRM) claim type, leaving a significant claim format gap.
Independent Claim Dissection
| Claim | Preamble | Transition | Key Body Elements |
|---|---|---|---|
| Claim 1 | A system for radio-based voice activity detection, | comprising: | transmitter transmitting standard-compliant radio signal (mobile cellular, WLAN, WiFi, IEEE 802.11/802.11bf) through wireless channel; receiver receiving signal where channel is impacted by target voice source and non-target voice source; processor configured to: extract TSCI (CSI/CFR/CIR), perform beamforming, compute directional TSCI (DTSCI), compute time series of radio feature (TSRF), compute radio spectrogram, detect pitch profile with time profile of pitch and harmonics, detect sequence of intermittent voiced/unvoiced speech based on continuous time trend and harmonics, detect voice activity of target separately from non-target voice source without using any media signalSearch prior art ↗ |
| Claim 18 | A method for radio-based voice activity detection, | comprising: | obtaining standard-compliant radio signal transmitted from transmitter to receiver through wireless channel (impacted by target voice and non-target voice, compliant to mobile cellular/WLAN/WiFi/IEEE 802.11/802.11bf); extracting set of TSCI (CSI/CFR/CIR); performing beamforming; computing DTSCI associated with direction of target voice source; computing TSRF from DTSCI; computing radio spectrogram; detecting pitch profile comprising time profile of pitch and harmonics; detecting sequence of intermittent voiced and unvoiced speech based on continuous time trend and harmonics; detecting voice activity of target voice source separately from non-target voice source without using any media signalSearch prior art ↗ |
Claim Dependency Tree
| Metric | This Application | Wireless / Signal Processing Norm |
|---|---|---|
| Total claims | 18 | 15 – 25 |
| Independent claim count | 2 | 2 – 4 |
| Dependent : Independent ratio | 8.00 : 1 | 4 – 8 : 1 |
| Method claims present? | Yes — Claim 18 | Common |
| System / apparatus claims? | Yes — Claim 1 | Always |
Drafting Quality Signals
The patent's primary strength is the precisely engineered 'without using any media signal' negative limitation in Claims 1 and 18, which creates a clear differentiation from prior art audio-based VAD systems and provides a prosecution-tested basis for novelty. A significant structural weakness is the absence of a computer-readable medium (CRM) claim type — the entire dependent claim tree hangs off only two independent claims covering system and method, leaving the software-implementation pathway entirely unprotected.
Strategic Intent Scorecard
Multi-dimensional assessment of this application's patent strategy quality, based on claim structure, specification depth, and prosecution positioning.
3 Critical Gaps in This Claim Set
A senior-attorney lens on the three highest-priority structural weaknesses — what each exposes in prosecution and litigation, and what a stronger filing would have done differently.
3 Critical Gaps in This Claim Set
See the full attorney-level analysis of what this application leaves unprotected — and how to draft it more defensively for your own filings.
US 12,272,375 B2 — key questions answered
Disclaimer: This analysis is generated by PatSnap Eureka AI based on publicly available patent data from the USPTO. It does not constitute legal advice and should not be relied upon as such. Patent data may be subject to change as prosecution progresses. Scores and assessments reflect automated analysis and may not capture all relevant legal or technical nuances. Always consult a qualified patent attorney for formal legal opinions on patentability, freedom to operate, or infringement.
PatSnap Eureka searches patents and data to answer instantly.