Book a demo

Cut patent&paper research from weeks to hours with PatSnap Eureka AI!

Try now

Gate-first vs gate-last high-k metal gate integration

Gate-First vs Gate-Last High-k Metal Gate Integration — PatSnap Insights
Semiconductor Technology

Gate-first and gate-last high-k/metal gate integration schemes each solve CMOS scaling’s leakage crisis differently — but their trade-offs in thermal budget, work function control, and EOT scalability determine which approach dominates at each process node. This analysis draws on patents from Texas Instruments, GlobalFoundries, IBM, and Samsung to map the technical decision landscape.

PatSnap Insights Team Innovation Intelligence Analysts 11 min read
Share
Reviewed by the PatSnap Insights editorial team ·

Why SiO₂/Polysilicon Gates Reached Their Limits

The transition to high-k/metal gate (HKMG) architectures became unavoidable once SiO₂ gate oxide thickness fell below approximately 1 nm: at that point, direct quantum mechanical tunneling causes exponentially increasing gate leakage current, driving standby power to unacceptable levels. This fundamental physics constraint — not a manufacturing limitation — forced the industry to find a replacement dielectric that could provide equivalent or lower capacitive coupling (measured as equivalent oxide thickness, or EOT) while being physically thick enough to suppress tunneling.

<1 nm
SiO₂ thickness at which tunneling leakage becomes critical
0.5 nm
Minimum EOT achievable via IL scavenging in replacement gate flows (IBM, 2012)
3.1%
EOT reduction from HfSiO vs. HfSiON at 28 nm (Samsung, 2021)
8+ patents
Active HKMG integration patent families in the dataset

The hafnium-based dielectrics that emerged from roughly a decade of intensive research — reviewed by SEMATECH in a 140-reference survey published in 2006 — can match the electrical performance of conventional SiO₂ while dramatically reducing leakage. Polysilicon gates, however, introduced a second problem: “polysilicon depletion,” a parasitic capacitance effect that increases effective EOT. This made metal gates a necessary companion to high-k dielectrics, as documented by IBM T.J. Watson Research Center in 2006.

The critical challenge introduced by metal gates is work function engineering. As a 2018 review of work function setting in high-k metal gate devices states directly: “In both integration schemes, getting the right work functions and threshold voltages for NMOS and PMOS devices is critical.” This is the central tension that differentiates gate-first from gate-last approaches — and it drives the entire patent and process engineering landscape examined here.

When SiO₂ gate oxide thickness falls below approximately 1 nm, direct quantum mechanical tunneling causes exponentially increasing gate leakage current, making hafnium-based high-k dielectrics — which provide equivalent or lower EOT while being physically thicker — a necessary replacement for continued CMOS scaling.

Equivalent Oxide Thickness (EOT)

EOT is the thickness of SiO₂ that would produce the same gate capacitance as the actual high-k dielectric stack. A lower EOT means stronger electrostatic control of the channel. High-k materials enable physically thicker films with lower EOT than SiO₂ alone, simultaneously reducing tunneling leakage and maintaining gate drive strength.

Gate-First Integration: Simplicity at the Cost of Thermal Budget

Gate-first integration deposits the high-k dielectric and metal gate early in the process flow — before source/drain formation, spacer deposition, and the high-temperature anneals (typically 900–1050°C) required to activate dopants. This mirrors the conventional polysilicon gate process sequence and is therefore more straightforwardly compatible with existing CMOS manufacturing infrastructure.

A canonical gate-first flow, as described in a 2012 US patent from the Institute of Microelectronics at the Chinese Academy of Sciences, proceeds as follows: an ultra-thin interfacial oxide or oxynitride is grown by rapid thermal oxidation; a high-k gate dielectric layer is formed by physical vapor deposition (PVD); rapid thermal annealing follows; then a metal nitride gate is deposited by PVD and selectively doped — P-type dopants for PMOS, N-type dopants for NMOS — by ion implantation using photoresist masks. Polysilicon is then deposited as a capping/hardmask layer, and the full gate stack is patterned by lithography and etching. A 2014 updated US patent from the same institution reinforces this sequence, describing selective ion implantation into the metal nitride gate to differentiate NMOS and PMOS work functions.

“The threshold voltage of devices is highly dependent on not just the deposited material properties but also on subsequent device processing steps.”

The primary advantage of gate-first processing is simplicity: the gate stack is patterned once, and conventional spacer and source/drain process modules follow. The critical drawback, however, is that the metal gate and high-k dielectric must survive all subsequent high-temperature steps. High-temperature source/drain annealing can cause interfacial reactions between the metal gate, the high-k film, and the silicon substrate, shifting flat-band voltages and effective work functions in unpredictable ways — a phenomenon sometimes called Fermi-level pinning or work-function roll-off. This makes achieving the precise mid-gap or band-edge work functions required for both NMOS and PMOS simultaneously very difficult under a single high-thermal-budget gate-first flow.

In gate-first high-k/metal gate integration, the metal gate and high-k dielectric are exposed to source/drain annealing temperatures of 900–1050°C, which can cause interfacial reactions that shift flat-band voltages and effective work functions unpredictably — making simultaneous band-edge work function control for both NMOS and PMOS transistors very difficult.

Gate-first flows remain relevant for emerging channel material integration. A 2022 study from Nanjing Institute of Technology applied a gate-first fabrication process alongside a two-step gate oxide strategy combining self-cleaning atomic layer deposition for InGaAs and ozone post-oxidation for Ge, achieving on-currents up to 8.3 µA/µm. This illustrates that gate-first processing can be adapted when the thermal budget of source/drain processing is tailored to be compatible with fragile high-k stacks on alternative channel materials.

Figure 1 — Gate-First vs. Gate-Last: Thermal Budget Exposure on the Gate Stack
Gate Stack Thermal Budget Exposure: Gate-First vs. Gate-Last High-k Metal Gate Integration 0°C 400°C 700°C 1050°C Max Temp Seen by Gate Stack 900–1050°C Gate-First (S/D anneal exposure) <400°C Gate-Last (BEOL only) Gate-First Gate-Last
Gate-first integration exposes the high-k/metal gate stack to source/drain annealing at 900–1050°C; gate-last replacement gate flows limit gate stack exposure to back-end-of-line temperatures below 400°C after gate fill, enabling far greater work function control.

Explore the full patent landscape for HKMG integration schemes in PatSnap Eureka.

Search HKMG Patents in PatSnap Eureka →

Gate-Last (Replacement Gate): Precision Through Process Inversion

The gate-last approach — also called the replacement metal gate (RMG) process — inverts the sequence: a sacrificial polysilicon dummy gate is formed first, source/drain regions are implanted and annealed at high temperature using the dummy gate as a mask, and then the sacrificial gate is removed by selective etching. The resulting gate trench is then filled with the high-k dielectric and the desired metal gate electrodes. This was the architecture commercialized by Intel at the 45 nm node.

The key benefit is that the high-k dielectric and metal gate are never exposed to the high-temperature source/drain anneal — they see only relatively low back-end-of-line (BEOL) temperatures — giving far greater freedom in selecting metal gate materials with precisely tuned work functions. A Texas Instruments WO patent from 2015 states the motivation clearly: “As the geometries for integrated circuits have scaled to smaller and smaller dimensions, polysilicon transistor gates have been replaced with metal gates to enable scaling to continue to smaller dimensions.” A 2016 EP version and a 2025 EP active version from the same Texas Instruments patent family confirm that the approach enables NMOS and PMOS gate stacks to be independently engineered with separate metal layers.

Work function differentiation in the replacement gate flow is achieved by depositing different metal layers selectively into NMOS and PMOS gate recesses. A 2022 GlobalFoundries patent describes a sophisticated multi-step fill process: a conformal sacrificial metal cap material (SMCM) layer is formed above a high-k gate insulation layer within both NMOS and PMOS replacement gate cavities; the SMCM is selectively removed from one cavity while remaining in the other; then a first conformal metal-containing material layer is deposited; that layer is removed from one cavity; and finally a second conformal metal layer fills both cavities — yielding independently programmed work functions for each transistor type. A 2015 GlobalFoundries patent extends this by driving lanthanide-based material into the high-k layer via thermal processing within the gate recesses, further tuning threshold voltages without exposing the stack to front-end high temperatures.

In gate-last replacement gate integration, work function differentiation between NMOS and PMOS transistors is achieved by sequentially depositing different metal layers selectively into separate gate recesses after source/drain annealing is complete, enabling independent threshold voltage programming for each transistor type without exposing the gate stack to temperatures above approximately 400°C.

EOT scaling in the replacement gate flow remains a challenge even with the thermal freedom it provides. IBM Thomas J. Watson Research Center research from 2012 shows that La-based higher-k materials (k > 20) can achieve EOT as low as 0.5–0.8 nm, but with effective work functions suitable primarily for NMOS. Remote interfacial layer (IL) scavenging — reducing the SiO₂ IL thickness by chemical or thermal means after gate stack deposition — can push EOT below 0.5 nm, but risks loss of effective work function (EWF) control and severe mobility degradation if the IL is eliminated entirely. These EOT–mobility–EWF trade-offs must be resolved within the replacement gate framework for each successive technology node, as documented by IBM researchers.

Figure 2 — EOT Scaling in Replacement Gate Flows: La-based High-k and IL Scavenging
EOT Scaling Targets in Gate-Last Replacement Gate High-k Metal Gate Integration 0 0.6 nm 0.9 nm 1.2 nm EOT (nm) ~1.2 nm Conventional SiO₂ ~0.8 nm HfO₂ Gate-Last 0.5–0.8 nm La-based HK (k>20) <0.5 nm IL Scavenging Gate-Last
IBM (2012) shows that La-based higher-k materials (k > 20) achieve EOT of 0.5–0.8 nm in replacement gate flows; interfacial layer scavenging can push EOT below 0.5 nm, but zero-IL conditions risk severe EWF instability and channel mobility degradation.

Hybrid High-k-First/High-k-Last: The Leading Industrial Solution

Recognizing that neither purely gate-first nor purely gate-last approaches simultaneously optimize NMOS and PMOS device characteristics, Texas Instruments developed and patented a hybrid integration scheme spanning multiple active jurisdictions — US (2016, 2018), EP (2016, 2025), JP (2017), WO (2015), and CA (2015) — making it the most extensively protected HKMG integration architecture in the dataset.

The invention provides “a metal gate NMOS transistor with a high-k first gate dielectric on a high quality thermally grown interface dielectric and with a metal gate PMOS transistor with a high-k last gate dielectric on a chemically grown interface dielectric.” The NMOS device is processed gate-first, benefiting from the superior quality of a thermally grown SiO₂ interfacial layer, which improves channel mobility and interface trap density. The PMOS device is processed gate-last, with the high-k dielectric deposited into the replacement gate recess atop a chemically grown interfacial dielectric.

Key finding: Why NMOS and PMOS need different approaches

The hybrid architecture exploits a fundamental asymmetry: NMOS performance is more sensitive to the quality of the Si/SiO₂ interface (favoring thermally grown gate-first processing), while PMOS work function setting is more sensitive to the metal/high-k interface (favoring the low-temperature replacement gate environment). Matching the integration scheme to each transistor type’s dominant sensitivity is the architectural insight behind Texas Instruments’ hybrid patent family.

TiN work function engineering within this hybrid framework is described in a Texas Instruments EP patent active through 2026: a thick TiN metal gate with work function greater than 4.85 eV is formed for PMOS replacement gate transistors, and a thin TiN metal gate with work function less than 4.25 eV is formed for NMOS replacement gate transistors, while a gate-first TiN option is also provided where both NMOS and PMOS receive the same TiN deposition differentiated by thickness or subsequent doping. This vertical coverage — from process architecture down to individual metal layer specifications — reflects the breadth of Texas Instruments’ intellectual property position in HKMG integration.

The 2018 review of work function setting in high-k metal gate devices, published in academic literature and drawing on data from multiple technology nodes, confirms that both gate-first and gate-last schemes have been implemented in high-volume production, and that the dominant direction at leading-edge nodes (sub-22 nm) has moved toward gate-last or hybrid architectures because of their superior work function control. According to IEEE publications tracking process node evolution, this trend has continued as FinFET and gate-all-around architectures adopted replacement gate flows as a baseline.

Analyse Texas Instruments’ and GlobalFoundries’ full HKMG patent portfolios with PatSnap Eureka’s AI-powered analysis tools.

Explore HKMG Patent Intelligence in PatSnap Eureka →

Gate-Stack Material Engineering at Advanced Nodes

Beyond the gate-first vs. gate-last process topology, advanced nodes require careful gate-stack material engineering to maximize the Ion/Ioff ratio and suppress leakage — and the choices made within an established integration scheme can deliver measurable performance gains without requiring a full process node change.

Samsung Electronics’ 2021 study on 28 nm low-power HK/MG devices provides a direct experimental demonstration: by replacing HfSiON with HfSiO thin films and engineering the TiN layer thickness within the gate stack, the electrical oxide thickness was reduced by 3.1% for NFET devices, improving the Ion/Ioff ratio for a given Ion target while appropriately suppressing gate leakage. This result shows that sub-layer composition engineering — the choice between HfSiO and HfSiON — yields quantifiable performance improvements even within an established gate-last process at a mature node.

Samsung Electronics’ 2021 study on 28 nm low-power HK/MG devices demonstrated that replacing HfSiON with HfSiO thin films and engineering TiN layer thickness reduced electrical oxide thickness by 3.1% for NFET devices, improving the Ion/Ioff ratio without requiring a full process node change.

An important practical constraint emerges when integrating embedded non-volatile memory alongside HKMG logic transistors. A Cypress Semiconductor JP patent from 2019 highlights that at 28 nm and beyond, integrating embedded SONOS-type NVM alongside HKMG logic requires significant additions to the mask set and process steps, because the NVM gate stack must be formed before or after the HKMG stack without disrupting either. This integration complexity is typically easier to manage in gate-last flows, where the late-stage gate fill step can be tailored per device type — a practical consideration that has influenced the adoption of replacement gate flows at foundries offering embedded NVM options.

At the most aggressive EOT targets, the replacement gate’s thermal freedom does not eliminate all scaling trade-offs. As documented by researchers at IBM, zero-IL conditions achieved through metal gate capping layers (e.g., TiN atop HfO₂) introduce severe EWF instability and channel mobility degradation, underscoring that EOT scaling below 0.5 nm remains a multi-variable optimization problem even within the replacement gate framework. These findings are consistent with the broader scaling roadmaps tracked by ITRS and successor organizations, which identify gate stack scaling as a persistent challenge at each successive node.

Head-to-Head: Gate-First vs. Gate-Last at a Glance

The fundamental trade-off between gate-first and gate-last integration is between interface dielectric quality — which favors gate-first for NMOS — and work function and EOT controllability, which favors gate-last for PMOS and aggressive scaling nodes. The table below consolidates the key technical attributes drawn from the patent and literature dataset.

Attribute Gate-First Gate-Last (Replacement Gate)
Process sequence HK/MG deposited before S/D anneal Dummy poly removed; HK/MG deposited into gate recess after S/D anneal
Thermal budget on gate stack High — 900–1050°C S/D anneals Low — BEOL temperatures only, typically <400°C after gate fill
Interface dielectric quality High — thermal oxidation possible Moderate — chemical oxidation or no IL in aggressive scaling
Work function stability Challenging — metals shift during high-T processing More controllable — metals not exposed to S/D anneal
NMOS/PMOS WF differentiation Selective ion implantation into metal nitride gate layer Selective metal fill sequences in NMOS/PMOS recesses
EOT scalability Limited by IL degradation under high-T Enables IL scavenging; EOT <0.5 nm achievable (IBM, 2012)
Process complexity Lower — single lithography for gate patterning Higher — requires dummy gate, CMP planarization, selective removal
Embedded NVM compatibility More straightforward sequential integration More complex; requires careful mask set planning
Hybrid approach NMOS gate-first (thermally grown IL) + PMOS gate-last = best of both — Texas Instruments patent family (US, EP, JP, WO, CA)
Figure 3 — Key Assignees in HKMG Integration Patent Landscape
Key Assignees in Gate-First vs Gate-Last High-k Metal Gate Integration Patent Landscape 0 2 4 6 Patent families / major contributions in dataset Texas Instruments 7 GlobalFoundries 2 Chinese Acad. of Sci. 2 IBM Research 2
Texas Instruments holds the most extensive HKMG integration patent portfolio in the dataset, with 7 active patent families spanning US, EP, JP, WO, and CA jurisdictions covering the hybrid high-k-first/high-k-last replacement gate process; GlobalFoundries and the Chinese Academy of Sciences each contribute 2 major patent families focused on replacement gate and gate-first methods respectively.

The patent landscape reflects clear organizational specialization: Texas Instruments owns the hybrid architecture space; GlobalFoundries drives multi-work-function replacement gate process innovations; the Institute of Microelectronics at the Chinese Academy of Sciences anchors gate-first dual-metal methodology; and IBM and Samsung contribute foundational literature on EOT limits and applied process engineering. Together, these assignees have defined the technical boundaries within which every leading-edge CMOS foundry now operates.

Frequently asked questions

Gate-first vs. gate-last high-k metal gate integration — key questions answered

Still have questions? Let PatSnap Eureka answer them for you.

Ask PatSnap Eureka for a Deeper Answer →

References

  1. The Progress and Challenges of Applying High-k/Metal-Gated Devices to Advanced CMOS Technologies — Texas State University, 2010
  2. Gate stack technology for nanoscale devices — SEMATECH, 2006
  3. Transistor scaling with novel materials — IBM T.J. Watson Research Center, 2006
  4. Work Function Setting in High-k Metal Gate Devices — 2018
  5. Method for integration of dual metal gates and dual high-k dielectrics in CMOS devices (US, 2012) — Institute of Microelectronics, Chinese Academy of Sciences
  6. Method for integration of dual metal gates and dual high-K dielectrics in CMOS devices (US, 2014) — Institute of Microelectronics, Chinese Academy of Sciences
  7. Hybrid high-k first and high-k last replacement gate process (US, 2016) — Texas Instruments Incorporated
  8. Hybrid high-k first and high-k last replacement gate process (US divisional, 2018) — Texas Instruments Incorporated
  9. Hybrid high-k first and high-k last replacement gate process (EP, 2025) — Texas Instruments Incorporated
  10. Hybrid high-k first and high-k last replacement gate process (WO, 2015) — Texas Instruments Japan Limited
  11. Hybrid high-k first and high-k last replacement gate process (EP, 2016) — Texas Instruments Incorporated
  12. Hybrid high-k first and high-k last replacement gate process (JP, 2017) — Texas Instruments Incorporated
  13. Methods of forming an IC product comprising transistor devices with different threshold voltage levels (2022) — GlobalFoundries
  14. Method of forming gate structures with multiple work functions (2015) — GlobalFoundries
  15. Ultimate Scaling of High-k Gate Dielectrics: Higher-k or Interfacial Layer Scavenging? — IBM Thomas J. Watson Research Center, 2012
  16. Gate-Stack Engineering to Improve the Performance of 28 nm Low-Power High-K/Metal-Gate Device — Samsung Electronics, 2021
  17. Process of forming high-k/metal gate CMOS transistors with titanium nitride gates (EP, active through 2026) — Texas Instruments
  18. Integration of memory transistors into a high-k, metal gate CMOS process flow (JP, 2019) — Cypress Semiconductor
  19. Heterogeneous CMOS Integration of InGaAs-OI nMOSFETs and Ge pMOSFETs Based on Dual-Gate Oxide Technique — Nanjing Institute of Technology, 2022
  20. IEEE — Process Node Evolution and FinFET/Gate-All-Around Integration Literature
  21. ITRS / IRDS — International Roadmap for Devices and Systems: Gate Stack Scaling Roadmaps
  22. SEMATECH — Pre-competitive HKMG Research and Gate Stack Technology Reviews

All data and statistics in this article are sourced from the references above and from PatSnap‘s proprietary innovation intelligence platform.

Your Agentic AI Partner
for Smarter Innovation

PatSnap fuses the world’s largest proprietary innovation dataset with cutting-edge AI to
supercharge R&D, IP strategy, materials science, and drug discovery.

Book a demo