Book a demo

Cut patent&paper research from weeks to hours with PatSnap Eureka AI!

Try now

Machine learning fixes TCAD convergence with 122,000× speed

Machine Learning in TCAD Simulation Convergence — PatSnap Insights
Semiconductor Innovation

Machine learning is not replacing TCAD simulation — it is rescuing it. Across more than 20 patents and publications spanning 2014 to 2026, a consistent picture emerges: hybrid ML-TCAD architectures resolve convergence failures, slash iteration counts, and deliver speedups that make next-generation device modeling tractable at advanced technology nodes.

PatSnap Insights Team Innovation Intelligence Analysts 11 min read
Share
Reviewed by the PatSnap Insights editorial team ·

Why TCAD Convergence Fails — and Why ML Is the Fix

Traditional TCAD simulation fails to converge primarily because iterative numerical solvers — particularly Newton-Raphson loops coupling Poisson’s equation with carrier transport equations — accumulate errors across thousands of iterations, and this problem intensifies sharply as device geometries shrink to advanced technology nodes. As documented by Samsung Electronics (2021), the core challenge is twofold: prohibitive computational cost and recurrent convergence errors in three-dimensional TCAD simulation. Critically, the same research explicitly states that complete replacement of TCAD with deep learning has not yet been achieved, framing ML instead as a convergence stabilizer operating within a restructured hybrid system.

122,000×
Max simulation speedup for FinFETs (Korea University, 2022)
R²=0.99
Neural surrogate accuracy on FinFET structures
10⁷×
Faster MOS calibration vs. direct TCAD (Xiangtan University, 2024)
100%
Transient convergence with ISRU activation (NYCU, 2021)

The earliest dataset-verified instance of data-driven convergence acceleration comes from a 2014 patent by Peking University Shenzhen Research Institute. The disclosed method records M historical input-output iteration pairs and uses weighted linear combinations to predict the next iteration input, directly reducing iteration count within semiconductor nanodevice solvers. This approach anticipated what has since become a broad research agenda: using prior simulation history to warm-start iterative solvers rather than initialising from scratch.

The scaling problem has made convergence failures progressively more expensive. At advanced nodes, a single three-dimensional TCAD simulation of a FinFET or gate-all-around structure can require hours of compute time on high-performance clusters, and a failed run — one that diverges before reaching a self-consistent solution — wastes that investment entirely. According to research published by IEEE, iterative solver stability in coupled partial differential equation systems is a well-established numerical challenge, and semiconductor device physics presents some of the most stiff variants of these systems due to the exponential carrier concentration dependencies in Boltzmann statistics.

Samsung Electronics (2021) documented that complete replacement of TCAD simulation with deep learning has not yet been achieved, and instead proposed a hybrid restructuring algorithm that enables deep learning and TCAD to complement each other while fully resolving convergence errors in three-dimensional device simulation.

What is TCAD simulation convergence?

Technology Computer-Aided Design (TCAD) simulation solves coupled partial differential equations — including Poisson’s equation and carrier transport equations — using iterative numerical methods such as Newton-Raphson. Convergence refers to the solver reaching a self-consistent solution within a specified tolerance. Convergence failure occurs when the iterative process diverges or stalls, producing no valid result and wasting all compute time invested in that simulation run.

The dataset spanning 2014 to 2026, drawing on more than 20 patents and peer-reviewed publications from institutions including Samsung Electronics, Synopsys, Gwangju Institute of Science and Technology (GIST), Tsinghua University, Seoul National University, Korea University, and Lam Research, reveals a consistent architectural philosophy: ML as a convergence mechanism, not a replacement solver.

Surrogate Models and Active Learning: Doing More with Fewer Simulations

The fundamental data acquisition problem in ML-TCAD integration is that training a predictive model requires TCAD simulation results, yet each simulation is the expensive resource being minimised. Active learning solves this directly by selecting only the most informative simulation points — and the evidence across the dataset is that this strategy substantially reduces the number of TCAD calls required to train accurate models.

Alsemi (주식회사 알세미), a Korean startup, has filed a cluster of patents (2024–2025) disclosing an active learning framework in which an agent queries TCAD simulations only for voltage-current or voltage-capacitance pairs that maximise model informativeness, then trains a neural network compact model on the returned results. This loop eliminates redundant simulations that would otherwise be required by exhaustive grid sampling. A related 2024 Alsemi patent extends the concept to reinforcement learning agents that optimise semiconductor process variables using the trained neural compact model.

Figure 1 — Active Learning vs. Exhaustive Sampling: Relative TCAD Query Reduction
TCAD Query Strategies for ML Compact Model Training: Active Learning vs. Exhaustive Sampling 0 25 50 75 Relative Query Count (%) 100% Exhaustive Sampling ~30% Active Learning ~20% Meta-Learned TCAD-Assist Exhaustive Active Learning (Alsemi) Meta-Learned TCAD-Assist (TSRI)
Active learning and meta-learned TCAD-assisted sampling substantially reduce the number of TCAD queries required to train accurate neural compact models, compared to exhaustive grid sampling. The meta-learner from Taiwan Semiconductor Research Institute (2022) achieves significantly lower mean square error during the first 100 sampling steps compared to pure ML approaches.

Tsinghua University has extended active learning into multi-objective transistor design optimisation (2025), using Gaussian process regression as a surrogate model and differential evolution to generate candidate process parameter sets. Only candidates with high predicted performance and high uncertainty are forwarded to TCAD for simulation, with results iteratively fed back to update the surrogate. This approach directly targets the Pareto front of transistor performance optimisation, substantially shortening the development cycle.

Taiwan Semiconductor Research Institute (2022) contributed a complementary Bayesian meta-learning approach specifically for semiconductor laser annealing — a process where experimental iteration is prohibitively expensive. The proposed meta-learner adjusts the hybridisation ratio between TCAD and ML when selecting the next sampling point, achieving significantly lower mean square error during the first 100 sampling steps compared to pure ML approaches. This TCAD-assisted meta-learned sampling strategy is directly relevant to fabrication processes where each physical experiment carries substantial cost, as acknowledged by standards bodies including SEMI.

Tsinghua University’s active learning strategy for transistor optimisation (2025) uses Gaussian process regression as a surrogate and differential evolution to generate candidates, forwarding only high-uncertainty, high-performance candidates to TCAD — directly targeting the Pareto front of multi-objective transistor performance and substantially shortening the development cycle.

For process simulation model calibration, Lam Research has filed multiple patent generations on optimising process simulation model parameters by minimising cost functions derived from CD-SEM, optical scatterometry, and transmission electron microscopy measurements. The optimisation loop adjusts pre-process profiles to match post-process metrology, representing a data-driven calibration approach that reduces manual tuning iterations. The company’s sustained investment across multiple patent generations (2020 and 2025) confirms this as a strategic priority in advanced etch and deposition process control.

Explore the full patent landscape for ML-TCAD convergence and surrogate modeling in PatSnap Eureka.

Search ML-TCAD Patents in PatSnap Eureka →

Neural Architecture Choices and Their Convergence Impact

The choice of neural network architecture determines which specific convergence bottleneck is addressed — and the dataset documents at least four distinct architectural paradigms, each targeting a different failure mode in the TCAD simulation pipeline.

Graph Neural Networks: Better Starting Points for Iterative Solvers

The most direct lever for Newton-Raphson convergence acceleration is the quality of the initial solution provided to the solver. Gwangju Institute of Science and Technology (GIST) holds patents (2021, 2023) on graph neural network architectures that derive an area graph from the device structure file, classify device type, and generate a compact-model-derived initial solution. This physically plausible starting point is passed to the TCAD simulator, dramatically reducing the number of Newton-Raphson iterations required to reach convergence. A hierarchical variant employs a higher-level deep neural network to provide the initial solution hint to a lower-level device simulator, creating a coarse-to-fine initialisation hierarchy.

CNNs for Quantum Transport: Accelerating the Most Expensive TCAD Kernel

The nonequilibrium Green’s function (NEGF) method is among the most computationally intensive kernels in TCAD, essential for quantum transport simulation in nanoscale FETs. Kobe University (2020) demonstrated that a convolutional autoencoder trained to predict carrier density and local quantum capacitance distributions from input potential distributions can substitute the NEGF computation within the self-consistent loop alongside Poisson’s equation. The result is accurate potentials across a range of gate lengths in significantly shorter computation time than conventional NEGF — making quantum transport simulation tractable within standard TCAD workflows. This approach is particularly relevant as gate lengths approach and fall below 5 nm, where quantum effects dominate device behaviour, a regime increasingly relevant to roadmaps published by the Semiconductor Industry Association.

Figure 2 — Neural Architecture Applications Across the TCAD Simulation Pipeline
Neural Network Architectures for TCAD Simulation Convergence: GNN, CNN, LSTM, and RNN Applications GNN Initial Solution Generation GIST (2021, 2023) CNN NEGF Quantum Transport Kobe Univ. (2020) LSTM Time-Step Control (PTA) Huada Empyrean (2023) RNN Sequential Process Emulation Samsung (2021) Converged TCAD Result
Four neural architectures address distinct convergence bottlenecks: GNNs provide physically consistent initial solutions (GIST), CNNs substitute expensive NEGF quantum transport kernels (Kobe University), LSTMs control pseudo-transient analysis time steps (Huada Empyrean Software), and RNNs emulate sequential process steps (Samsung Electronics).

LSTM Networks: Preventing Pseudo-Transient Analysis Divergence

Pseudo-transient analysis (PTA) is a widely used continuation algorithm for DC convergence in TCAD-linked circuit simulation. Its most common failure mode is time-step selection: too large a step causes divergence, while an overly conservative step wastes computation. Huada Empyrean Software (2023) proposes an LSTM-based method for controlling the time step in PTA, exploiting the LSTM’s temporal memory to predict future time steps from the history of the simulation trajectory. A coarse-and-fine-grained hybrid sampling strategy identifies the optimal time step, and the LSTM network predicts forward to avoid both divergence and excessive conservatism. This directly addresses one of the most common convergence failure modes in both TCAD and SPICE simulation engines.

“Neural network surrogate models for FinFET simulation achieve R² = 0.99 accuracy while delivering speedups exceeding 122,000× compared to traditional simulators — a benchmark that establishes the practical viability of ML-TCAD hybrid deployment.”

Recurrent Networks for Sequential Process Emulation

Samsung Electronics (2021) disclosed a simulation system built around a recurrent neural network consisting of process emulation cells arranged in time series. Each cell receives the previous output profile, target profile, and process condition information, generating the current output profile using time-series causal relationships encoded as prior knowledge. This architecture mirrors the sequential physics of multi-step semiconductor manufacturing — each process step depends on the output state of the preceding step — enabling step-by-step profile prediction that bypasses TCAD at each individual process step while preserving the causal dependencies that make the predictions physically meaningful.

Kobe University (2020) demonstrated that a convolutional autoencoder trained to predict carrier density and local quantum capacitance distributions from input potential distributions can substitute the computationally intensive NEGF (nonequilibrium Green’s function) kernel within the TCAD self-consistent loop, yielding accurate potentials across a range of gate lengths in significantly shorter computation time than conventional NEGF simulation.

Optimising Simulator Settings and Compact Model Convergence

Beyond replacing TCAD kernels, machine learning has been applied to optimising the configuration of TCAD simulators themselves — mesh density, solver tolerance, stepping strategy — parameters that critically affect both convergence and accuracy without changing the underlying physics equations being solved.

Synopsys (2022) filed the most direct patent expression of this concept: an ML framework that predicts optimised TCAD simulator system settings for future simulation executions. By learning from past simulation runs — which settings led to convergence and which did not — the system preemptively configures the solver for new simulations. This transforms simulator configuration from a manual expert task into a learned, data-driven process, and represents a natural extension of the company’s broader EDA toolchain strategy.

Samsung Electronics (2024) addressed the specific problem of regression model prediction failure through an adaptive retraining loop. When the consistency of a trained neural network model falls below a target threshold, the system identifies consistency reduction factors — regions of the input space where the model fails — and retrains on additional sample data targeted at those regions. This is conceptually equivalent to active learning but applied to model consistency rather than output uncertainty, and it ensures that deployed neural models remain reliable across the full operating range of device parameters.

Key finding: ISRU activation achieves 100% SPICE convergence

Research from National Yang-Ming Chiao-Tung University, Taiwan (2021) evaluated sigmoid, tanh, ReLU, and ISRU (inverse square root unit) activation functions in ML compact models. ISRU activation achieved 100% convergence in transient analysis, substantially outperforming all other activations. Including gate-to-source and gate-to-drain capacitances in the model further improved SPICE convergence — a direct design specification for ML compact models intended for integration into SPICE engines.

The impact of activation function selection on SPICE convergence is a finding with direct practical implications for how ML compact models should be architected. The ISRU activation function maintains smooth, bounded derivatives across the full input range, which prevents the gradient discontinuities that cause Newton-Raphson divergence in SPICE engines — a property that sigmoid and tanh share in principle but that ReLU lacks entirely due to its non-differentiability at zero. This analysis, published by researchers at National Yang-Ming Chiao-Tung University and consistent with convergence theory reviewed by ACM, has direct implications for compact model standardisation efforts.

Xiangtan University (2024) reported a machine learning-based MOS transistor simulation model calibration method that combines a classifier, regressor, and surrogate model to calibrate TCAD parameters, reporting computation speed approximately 10⁷ times faster than direct TCAD simulation. The method trains a random forest model to replace expensive TCAD calls during the iterative calibration loop, demonstrating that ensemble methods — not just deep learning — can deliver dramatic acceleration for specific calibration tasks.

National Yang-Ming Chiao-Tung University (2021) found that ISRU (inverse square root unit) activation in machine learning compact models achieves 100% convergence in transient SPICE analysis, substantially outperforming sigmoid, tanh, and ReLU activations — with gate-to-source and gate-to-drain capacitances in the model providing additional convergence benefit.

Track semiconductor simulation patents from Samsung, Synopsys, Lam Research, and emerging players in real time.

Analyse TCAD Innovation in PatSnap Eureka →

Key Players, Patent Landscape, and Innovation Trajectory

Samsung Electronics and Synopsys are the most prolific patent filers in ML-TCAD convergence, while academic contributors from Korea, Taiwan, and China dominate the peer-reviewed literature — a bifurcation that reflects the different incentives driving industrial IP protection versus academic publication.

Figure 3 — Patent and Publication Activity by Organisation in ML-TCAD Convergence (2014–2026)
ML-TCAD Simulation Convergence: Patent and Publication Activity by Organisation 2014–2026 0 1 2 3 4 Number of Patents / Publications in Dataset Samsung Electronics 4 Synopsys 2 Alsemi Co., Ltd. 3 GIST 2 Lam Research 2 Academic (KR/TW/CN) 6+ Industrial Patent Filer Startup/Institute Patent Academic Publications
Samsung Electronics leads industrial patent filing, with Alsemi representing the most focused startup effort. Academic institutions from Korea, Taiwan, and China collectively contribute the largest body of peer-reviewed evidence across the dataset of 20+ sources spanning 2014–2026.

Samsung Electronics is the dominant patent filer, with contributions spanning semiconductor design automation systems that integrate ML with TCAD simulation data, neural network-based semiconductor characteristic prediction, recurrent process simulation, and adaptive retraining loops for neural network model consistency. Its 2021 foundational paper on 3D TCAD restructuring establishes the hybrid complementarity thesis that the rest of the field has largely adopted.

Synopsys contributes both the ML-optimised simulator settings patent (2022) and an ML-driven IC design prediction framework (EP, 2024), as well as a 2019 machine learning-assisted DTCO (Design-Technology Co-Optimisation) modeling framework — establishing it as the leading EDA vendor in this space and consistent with its broader strategy of embedding intelligence into simulation toolchains, as tracked by PatSnap’s IP intelligence platform.

Alsemi has filed a cluster of active learning patents for neural compact model generation (2024–2025), including a reinforcement learning agent for process variable optimisation. This represents a focused Korean startup effort targeting the TCAD-to-compact-model pipeline — a commercially valuable bottleneck between process simulation and circuit design.

From a trend perspective, the dataset shows clear progression: from simple neural network surrogates trained offline (2019–2021), through active learning and adaptive retraining loops (2022–2023), toward multi-agent reinforcement learning and large language model integration for parameter optimisation (2024–2025). This trajectory indicates rapid expansion of the ML-TCAD solution space, with each generation addressing limitations identified in the prior one. The PatSnap Insights blog tracks emerging innovation across the semiconductor design automation landscape.

The ML-TCAD patent landscape shows progression from offline neural network surrogates (2019–2021) through active learning and adaptive retraining loops (2022–2023) toward multi-agent reinforcement learning and large language model integration for parameter optimisation (2024–2025), with Samsung Electronics and Synopsys as the most prolific industrial patent filers across the dataset of 20+ sources.

Frequently asked questions

Machine learning in TCAD simulation — key questions answered

Still have questions? Let PatSnap Eureka answer them for you.

Ask PatSnap Eureka for a Deeper Answer →

References

  1. Restructuring TCAD System: Teaching Traditional TCAD New Tricks — Samsung Electronics Data and Information Technology Center, 2021
  2. Method and computing device for improving the generation efficiency of neural compact models — Alsemi Co., Ltd., 2025
  3. Active learning methods and computing devices for neural compact models — Alsemi Co., Ltd., 2024
  4. Machine learning method and framework for optimizing setups for accurate, speedy and robust TCAD simulations — Synopsys, Inc., 2022
  5. Circuit convergence study using machine learning compact models — National Yang-Ming Chiao-Tung University, Taiwan, 2021
  6. Meta-Learned and TCAD-Assisted Sampling in Semiconductor Laser Annealing — Taiwan Semiconductor Research Institute, 2022
  7. Accelerating DC Circuit Simulation through Feature Selection and LSTM-Based Time-Step Control — Huada Empyrean Software Co., Ltd., 2023
  8. Simulator acceleration and inverse design of fin field-effect transistors using machine learning — Korea University, 2022
  9. Semiconductor device simulation system and semiconductor device simulation method — Gwangju Institute of Science and Technology, 2023
  10. Electronic device, method, and computer readable medium for simulation of semiconductor device — Gwangju Institute of Science and Technology, 2021
  11. Acceleration of nonequilibrium Green’s function simulation for nanoscale FETs by applying convolutional neural network model — Kobe University, 2020
  12. Modeling method of neural network for simulation in semiconductor design process — Samsung Electronics Co., Ltd., 2024
  13. Method of predicting characteristic of semiconductor device and computing device performing the same — Samsung Electronics Co., Ltd., 2023
  14. Semiconductor design automation system and computing system including the same — Samsung Electronics Co., Ltd., 2023
  15. Method for efficiently optimizing process variables in semiconductor devices using neural compact model — Alsemi Co., Ltd., 2024
  16. Process simulation model calibration using CD-SEM — Lam Research Corporation, 2025
  17. Machine-learning driven prediction in integrated circuit design — Synopsys, Inc., 2024
  18. Simulation system for semiconductor process and simulation method thereof — Samsung Electronics Co., Ltd., 2021
  19. Active learning strategy-based transistor key characteristic optimization method and system — Tsinghua University, 2025
  20. A machine learning-based MOS transistor simulation model calibration method — Xiangtan University, 2024
  21. A rapid simulation method for semiconductor nanodevices based on historical information — Peking University Shenzhen Research Institute, 2014
  22. IEEE — Institute of Electrical and Electronics Engineers
  23. Semiconductor Industry Association (SIA)
  24. ACM — Association for Computing Machinery
  25. SEMI — Global Semiconductor Industry Association

All data and statistics in this article are sourced from the references above and from PatSnap‘s proprietary innovation intelligence platform.

Your Agentic AI Partner
for Smarter Innovation

PatSnap fuses the world’s largest proprietary innovation dataset with cutting-edge AI to
supercharge R&D, IP strategy, materials science, and drug discovery.

Book a demo