Book a demo

Gate-first vs gate-last high-k metal gate integration

Gate-First vs Gate-Last High-k Metal Gate Integration — PatSnap Insights
Semiconductor Technology

The choice between gate-first and gate-last high-k/metal gate integration is one of the most consequential process decisions in advanced CMOS manufacturing — governing thermal budget, work function stability, EOT scalability, and ultimately transistor performance at every node below 45 nm. This analysis draws on patent filings from Texas Instruments, GlobalFoundries, IBM, and the Chinese Academy of Sciences, alongside peer-reviewed literature from SEMATECH, Samsung, and Purdue University.

PatSnap Insights Team Innovation Intelligence Analysts 14 min read
Share
Reviewed by the PatSnap Insights editorial team ·

Why High-k/Metal Gates Became Unavoidable for CMOS Scaling

The transition to high-k/metal gate (HKMG) architectures was forced by a fundamental physics limit: as SiO₂ gate oxide thickness falls below approximately 1 nm, direct quantum mechanical tunneling causes exponentially increasing gate leakage current, raising standby power to unacceptable levels. This finding, established in a 2010 Texas State University review of HKMG devices and corroborated by a 2006 SEMATECH gate-stack review citing 140 references, made continued SiO₂ scaling physically untenable.

The solution is to replace SiO₂ with a physically thicker high-k dielectric — most commonly hafnium-based materials — that provides the same or lower equivalent oxide thickness (EOT) while dramatically reducing leakage. Hafnium-based dielectrics, after roughly a decade of intensive research documented by SEMATECH, achieved electrical performance matching conventional SiO₂ at a fraction of the leakage. Simultaneously, polysilicon gates were found to suffer from “polysilicon depletion” — a parasitic capacitance effect that increases effective EOT — making metal gate electrodes a necessary companion to high-k dielectrics, as noted by IBM’s T.J. Watson Research Center in 2006.

<1 nm
SiO₂ thickness triggering quantum tunneling leakage
<0.5 nm
EOT achievable via IL scavenging in replacement gate flows (IBM, 2012)
3.1%
EOT reduction from HfSiO vs HfSiON at 28 nm (Samsung, 2021)
8
Distinct patent families on gate-first/gate-last HKMG integration in the dataset

The critical challenge introduced by metal gates is work function engineering. As a 2018 review on work function setting in high-k metal gate devices states directly: “In both integration schemes, getting the right work functions and threshold voltages for NMOS and PMOS devices is critical.” This single requirement — setting band-edge work functions independently for both transistor types without destabilising the high-k stack — is the central tension that differentiates gate-first from gate-last process architectures.

Equivalent Oxide Thickness (EOT) defined

EOT is the thickness of SiO₂ that would produce the same gate capacitance as the actual high-k dielectric stack. A lower EOT means stronger electrostatic control of the channel. Reducing EOT without increasing leakage or degrading mobility is the primary design objective of every high-k gate-stack engineering effort.

Gate-First Integration: Simplicity at the Cost of Thermal Exposure

In gate-first HKMG integration, the high-k dielectric and metal gate are deposited early in the process flow — before source/drain formation, spacer deposition, and the high-temperature anneals (typically 900–1050°C) required to activate dopants. This mirrors the conventional polysilicon gate sequence and is therefore more compatible with existing CMOS manufacturing infrastructure.

In gate-first high-k/metal gate integration, the high-k dielectric and metal gate are deposited before source/drain annealing, exposing the gate stack to temperatures of 900–1050°C that can shift effective work functions unpredictably through interfacial reactions between the metal gate, the high-k film, and the silicon substrate.

A canonical gate-first flow, described in a 2012 patent from the Institute of Microelectronics at the Chinese Academy of Sciences, proceeds as follows: an ultra-thin interfacial oxide or oxynitride is grown by rapid thermal oxidation; a high-k gate dielectric layer is formed by physical vapor deposition (PVD); rapid thermal annealing follows; then a metal nitride gate is deposited by PVD and selectively doped — P-type dopants for PMOS, N-type dopants for NMOS — by ion implantation using photoresist masks. Polysilicon is then deposited as a capping and hardmask layer, and the full gate stack is patterned by lithography and etching. A 2014 update from the same institution reinforces this sequence, describing selective ion implantation into the metal nitride gate to differentiate NMOS and PMOS work functions.

Figure 1 — Gate-First vs Gate-Last: Thermal Budget on the Gate Stack
Gate-first vs gate-last high-k metal gate: thermal budget comparison for CMOS integration 0 250 500 750 1050 Temperature (°C) 900–1050°C Gate-First (S/D anneal exposure) <400°C Gate-Last (BEOL only) Gate-First Gate-Last
Gate-first integration exposes the high-k/metal gate stack to source/drain anneal temperatures of 900–1050°C; gate-last (replacement gate) processing limits gate stack exposure to back-end temperatures below 400°C, enabling far greater work function control.

The primary advantage of gate-first processing is simplicity: the gate stack is patterned once, and conventional spacer and source/drain process modules follow without requiring dummy gate formation, chemical mechanical planarization (CMP), or selective gate removal. However, the critical drawback is that high-temperature source/drain annealing can cause interfacial reactions between the metal gate, the high-k film, and the silicon substrate, shifting flat-band voltages and effective work functions in unpredictable ways — a phenomenon sometimes called Fermi-level pinning or work-function roll-off. This makes achieving precise mid-gap or band-edge work functions for both NMOS and PMOS simultaneously very difficult under a single high-thermal-budget gate-first flow.

Gate-first flows remain relevant for emerging channel material integration. A 2022 study from the Nanjing Institute of Technology applied gate-first fabrication to heterogeneous CMOS integrating InGaAs-OI nMOSFETs and Ge pMOSFETs, using a two-step gate oxide strategy — self-cleaning atomic layer deposition for InGaAs and ozone post-oxidation for Ge — and achieved on-currents up to 8.3 µA/µm. This demonstrates that gate-first flows can be viable where source/drain thermal budgets are tailored to be compatible with fragile high-k stacks on alternative channel materials.

Gate-Last (Replacement Gate): Work Function Control at Scale

The gate-last approach — also called the replacement metal gate (RMG) process — inverts the sequence entirely: a sacrificial dummy polysilicon gate is formed first, source/drain regions are implanted and annealed at high temperature using the dummy gate as a mask, and then the sacrificial gate is removed by selective etching. The resulting gate trench is filled with the high-k dielectric and the desired metal gate electrodes. This architecture was commercialized by Intel at the 45 nm node and has since become the dominant approach at leading-edge nodes.

In gate-last (replacement metal gate) HKMG integration, the high-k dielectric and metal gate are deposited into a gate recess after high-temperature source/drain annealing, protecting the gate stack from temperatures above 400°C and enabling independent work function engineering for NMOS and PMOS transistors through selective metal fill sequences.

A Texas Instruments patent family spanning US, EP, JP, and WO jurisdictions — with active patents as recently as 2025 — states the motivation directly: “As the geometries for integrated circuits have scaled to smaller and smaller dimensions, polysilicon transistor gates have been replaced with metal gates to enable scaling to continue to smaller dimensions.” The key benefit is that the high-k dielectric and metal gate never see the high-temperature source/drain anneal, giving far greater freedom in selecting metal gate materials with precisely tuned work functions.

“In both integration schemes, getting the right work functions and threshold voltages for NMOS and PMOS devices is critical” — and it is the gate-last approach’s protection of the metal gate from high-temperature steps that makes precise band-edge work function setting achievable in high-volume production.

Work function differentiation in the replacement gate flow is achieved by depositing different metal layers selectively into NMOS and PMOS gate recesses. A 2022 GlobalFoundries patent describes a sophisticated multi-step fill process: a conformal sacrificial metal cap material (SMCM) layer is formed above a high-k gate insulation layer within both NMOS and PMOS replacement gate cavities; the SMCM is selectively removed from one cavity while remaining in the other; a first conformal metal-containing material layer is deposited; that layer is removed from one cavity; and finally a second conformal metal layer fills both cavities — yielding independently programmed work functions for each transistor type. A 2015 GlobalFoundries patent extends this by driving lanthanide-based material into the high-k layer via thermal processing within the gate recesses, further tuning threshold voltages for both NMOS and PMOS without exposing the stack to front-end high temperatures.

Explore the full patent landscape for high-k/metal gate replacement gate processes with PatSnap Eureka.

Search HKMG Patents in PatSnap Eureka →

EOT scaling in the replacement gate flow presents its own challenges. IBM’s Thomas J. Watson Research Center showed in 2012 that La-based higher-k materials (k > 20) can achieve EOT as low as 0.5–0.8 nm, but with effective work functions suitable primarily for NMOS. Remote interfacial layer (IL) scavenging — reducing the SiO₂ IL thickness by chemical or thermal means after gate stack deposition — can push EOT below 0.5 nm but risks loss of effective work function (EWF) control and severe mobility degradation if the IL is eliminated entirely. These EOT-mobility-EWF trade-offs must be resolved within the replacement gate framework for each successive technology node, according to IBM research.

Hybrid High-k-First/High-k-Last: The Leading Industrial Solution

Recognising that neither purely gate-first nor purely gate-last approaches simultaneously optimise NMOS and PMOS device characteristics, Texas Instruments developed and patented a hybrid integration scheme that applies each approach selectively to the transistor type it best serves. This architectural insight is protected by one of the most extensive HKMG patent families in the dataset, spanning US, EP, JP, WO, and CA jurisdictions with filings starting in 2013 and active protection as recently as 2025.

Texas Instruments’ hybrid high-k-first/high-k-last integration scheme applies gate-first processing to NMOS transistors — using a thermally grown SiO₂ interfacial layer that improves channel mobility and interface trap density — and gate-last (replacement gate) processing to PMOS transistors, where a chemically grown interfacial dielectric and low-temperature metal fill enable precise work function setting.

The invention provides “a metal gate NMOS transistor with a high-k first gate dielectric on a high quality thermally grown interface dielectric and with a metal gate PMOS transistor with a high-k last gate dielectric on a chemically grown interface dielectric.” The NMOS device benefits from the superior quality of a thermally grown SiO₂ interfacial layer, which improves channel mobility and reduces interface trap density. The PMOS device benefits from the low-temperature replacement gate environment, where the metal/high-k interface can be precisely engineered without work-function roll-off from thermal exposure.

Figure 2 — Hybrid HKMG Integration: Process Assignment by Transistor Type
Hybrid gate-first gate-last high-k metal gate integration scheme for CMOS NMOS and PMOS transistors NMOS Thermal SiO₂ Interface Layer High-k Deposition Gate-First (HfO₂) Metal Gate Thin TiN (<4.25 eV) S/D Anneal Gate-First Complete PMOS Dummy Poly Gate S/D Anneal First Dummy Removal Gate Recess Formed High-k + Metal Fill Gate-Last (Chemical IL) Thick TiN WF >4.85 eV
Texas Instruments’ hybrid scheme assigns gate-first processing (thermally grown SiO₂ interface) to NMOS and gate-last replacement gate processing (chemically grown interface, selective metal fill) to PMOS — exploiting the different thermal sensitivities of each transistor type. TiN work function targets per Texas Instruments EP patent active through 2026.

This architectural split exploits a key asymmetry: NMOS performance is more sensitive to the quality of the Si/SiO₂ interface (favouring thermal growth, which gate-first enables), while PMOS work function setting is more sensitive to the metal/high-k interface (favouring the low-temperature replacement gate environment). A Texas Instruments EP patent active through 2026 specifies the work function targets precisely: a thick TiN metal gate with work function greater than 4.85 eV for PMOS replacement gate transistors, and a thin TiN metal gate with work function less than 4.25 eV for NMOS replacement gate transistors.

Key finding: Hybrid architecture dominates leading-edge production

Texas Instruments holds active patents on the hybrid high-k-first/high-k-last replacement gate process in US, EP, JP, WO, and CA jurisdictions — with filings from 2013 and EP protection confirmed as recently as 2025. The breadth of this portfolio, alongside GlobalFoundries’ replacement gate innovations, confirms that hybrid and gate-last architectures are the dominant direction at sub-22 nm nodes.

Gate-Stack Material Engineering and EOT Scaling Limits

Beyond process topology, advanced nodes require careful gate-stack material engineering to maximise the Ion/Ioff ratio and suppress leakage. Samsung Electronics’ 2021 experimental study of a 28 nm low-power HKMG device provides a direct demonstration: by replacing HfSiON with HfSiO thin films and engineering the TiN layer thickness within the gate stack, the electrical oxide thickness was reduced by 3.1% for NFET devices. This improved the Ion/Ioff ratio for a given Ion target while appropriately suppressing gate leakage — all without requiring a full process node change. The finding underscores that sub-layer composition choices within an established gate-last process remain a meaningful performance lever.

Samsung Electronics’ 2021 study demonstrated that replacing HfSiON with HfSiO thin films and tuning TiN layer thickness in a 28 nm low-power HKMG device reduced electrical oxide thickness by 3.1% for NFET devices, improving the Ion/Ioff ratio without a full process node change.

At the most aggressive EOT targets, IBM’s 2012 analysis documents that IL scavenging in a replacement gate context — using metal gate capping layers such as TiN atop HfO₂ — can reduce the interfacial SiO₂ layer to sub-angstrom thicknesses, achieving sub-0.5 nm EOT. However, zero-IL conditions introduce severe effective work function instability and channel mobility degradation, underscoring that the replacement gate’s thermal freedom does not eliminate all scaling trade-offs. According to research published by Nature and affiliated journals on semiconductor scaling, the interplay between dielectric quality, mobility, and leakage remains an active area of investigation at every new node.

Integrating embedded non-volatile memory (NVM) alongside HKMG logic transistors introduces further complexity. A 2019 Cypress Semiconductor patent highlights that at 28 nm and beyond, integrating SONOS-type NVM alongside HKMG logic requires significant additions to the mask set and process steps, because the NVM gate stack must be formed before or after the HKMG stack without disrupting either. This integration complexity is typically easier to manage in gate-last flows, where the late-stage gate fill step can be tailored per device type.

Track EOT scaling innovations and HKMG material patents across all major assignees with PatSnap Eureka.

Analyse Gate-Stack Patents in PatSnap Eureka →

Head-to-Head: Key Attributes of Gate-First vs Gate-Last HKMG Integration

The fundamental trade-off between gate-first and gate-last integration can be summarised across eight technical dimensions. Gate-first offers process simplicity and superior thermal SiO₂ interface quality for NMOS, while gate-last provides the work function controllability and EOT scalability required for PMOS and aggressive scaling nodes. The 2018 work function review confirms that both schemes have been implemented in high-volume production, and the dominant direction at leading-edge nodes (sub-22 nm) has been toward gate-last or hybrid architectures because of their superior work function control — as reflected in the breadth of Texas Instruments’ hybrid patent portfolio and GlobalFoundries’ replacement gate process innovations.

Attribute Gate-First Gate-Last (Replacement Gate)
Process sequence HK/MG deposited before S/D anneal Dummy poly removed; HK/MG deposited into gate recess after S/D anneal
Thermal budget on gate stack High — 900–1050°C S/D anneals Low — BEOL temperatures only, typically <400°C after gate fill
Interface dielectric quality High — thermal oxidation possible Moderate — chemical oxidation or no IL at aggressive scaling
Work function stability Challenging — metals shift during high-T processing More controllable — metals not exposed to S/D anneal
NMOS/PMOS WF differentiation Selective ion implantation into metal nitride gate layer Selective metal fill in NMOS/PMOS recesses (multi-step SMCM process)
EOT scalability Limited by IL degradation under high-T IL scavenging enables EOT <0.5 nm (IBM, 2012)
Process complexity Lower — single lithography for gate patterning Higher — dummy gate, CMP planarization, selective removal required
Embedded NVM integration More straightforward sequential integration More complex; requires careful mask set planning (Cypress, 2019)

The patent data confirms clear assignee concentrations reflecting these trade-offs. Texas Instruments holds the broadest hybrid HKMG portfolio. GlobalFoundries leads in replacement gate multi-work-function innovations. The Institute of Microelectronics at the Chinese Academy of Sciences holds foundational gate-first dual-metal-gate patents from 2012 and 2014. IBM’s T.J. Watson Research Center contributes the most-cited EOT scaling analysis. Samsung provides applied process engineering evidence at 28 nm. SEMATECH, as noted by standards bodies including IEEE, provided influential pre-competitive research that underpins the entire HKMG transition. Together, these players have shaped a technology landscape where gate-last and hybrid architectures dominate at sub-22 nm, while gate-first retains relevance for alternative channel materials and less aggressive thermal budget scenarios.

Frequently asked questions

Gate-first vs gate-last high-k metal gate integration — key questions answered

Still have questions? Let PatSnap Eureka answer them for you.

Ask PatSnap Eureka for a deeper answer →

References

  1. The Progress and Challenges of Applying High-k/Metal-Gated Devices to Advanced CMOS Technologies — Texas State University, 2010
  2. Gate stack technology for nanoscale devices — SEMATECH, 2006
  3. Transistor scaling with novel materials — IBM T.J. Watson Research Center, 2006
  4. Work Function Setting in High-k Metal Gate Devices, 2018
  5. Method for integration of dual metal gates and dual high-k dielectrics in CMOS devices (US, 2012) — Institute of Microelectronics, Chinese Academy of Sciences
  6. Method for integration of dual metal gates and dual high-K dielectrics in CMOS devices (US, 2014) — Institute of Microelectronics, Chinese Academy of Sciences
  7. Hybrid high-k first and high-k last replacement gate process (US, 2016) — Texas Instruments Incorporated
  8. Hybrid high-k first and high-k last replacement gate process (US divisional, 2018) — Texas Instruments Incorporated
  9. Hybrid high-k first and high-k last replacement gate process (EP, 2025) — Texas Instruments Incorporated
  10. Hybrid high-k first and high-k last replacement gate process (WO, 2015) — Texas Instruments Japan Limited
  11. Hybrid high-k first and high-k last replacement gate process (EP, 2016) — Texas Instruments Incorporated
  12. Hybrid high-k first and high-k last replacement gate process (JP, 2017) — Texas Instruments Incorporated
  13. Methods of forming an IC product comprising transistor devices with different threshold voltage levels — GlobalFoundries, 2022
  14. Method of forming gate structures with multiple work functions — GlobalFoundries, 2015
  15. Ultimate Scaling of High-k Gate Dielectrics: Higher-k or Interfacial Layer Scavenging? — IBM Thomas J. Watson Research Center, 2012
  16. Gate-Stack Engineering to Improve the Performance of 28 nm Low-Power High-K/Metal-Gate Device — Samsung Electronics, 2021
  17. Process of forming high-k/metal gate CMOS transistors with titanium nitride gates — Texas Instruments (EP, active through 2026)
  18. Integration of memory transistors into a high-k, metal gate CMOS process flow — Cypress Semiconductor (JP, 2019)
  19. Heterogeneous CMOS Integration of InGaAs-OI nMOSFETs and Ge pMOSFETs Based on Dual-Gate Oxide Technique — Nanjing Institute of Technology, 2022
  20. WIPO — World Intellectual Property Organization: Patent data and CMOS technology filings
  21. IEEE — Institute of Electrical and Electronics Engineers: Semiconductor device and process standards
  22. Nature — Peer-reviewed research on semiconductor scaling and gate dielectric materials

All data and statistics in this article are sourced from the references above and from PatSnap‘s proprietary innovation intelligence platform.

Your Agentic AI Partner
for Smarter Innovation

PatSnap fuses the world’s largest proprietary innovation dataset with cutting-edge AI to
supercharge R&D, IP strategy, materials science, and drug discovery.

Book a demo