Silicon Nanowire Tunneling Field-Effect Transistor Arrays: Improving Subthreshold Performance Using Excimer Laser Annealing

Joshua Thomas Smith  
*Purdue University*, jtsmith@purdue.edu

Christian Sandow  
*Peter Gruenberg Institut 9 (PGI-9-IT)*, sandow@netdust.de

Saptarshi Das  
*Purdue University*, sdas@purdue.edu

Renato A. Minamisawa  
*Peter Gruenberg Institut 9 (PGI-9-IT)*, r.a.minamisawa@fz-juelich.de

Siegfried Mantl  
*Peter Gruenberg Institut 9 (PGI-9-IT)*, s.mantl@fz-juelich.de

*See next page for additional authors*
Silicon Nanowire Tunneling Field-Effect Transistor Arrays: Improving Subthreshold Performance Using Excimer Laser Annealing

Joshua T. Smith, Christian Sandow, Saptarshi Das, Renato A. Minamisawa, Siegfried Mantl, Member, IEEE, and Joerg Appenzeller, Fellow, IEEE

Abstract—We have experimentally established that the inverse subthreshold slope $S$ of a Si nanowire tunneling field-effect transistor (NW-TFET) array can be within 9% of the theoretical limit when the doping profile along the channel is properly engineered. In particular, we have demonstrated that combining excimer laser annealing with a low-temperature rapid thermal anneal results in an abrupt doping profile at the source/channel interface as evidenced by the electrical characteristics. Gate-controlled tunneling has been confirmed by evaluating $S$ as a function of temperature. The good agreement between our experimental data and simulation allows performance predictions for more aggressively scaled TFETs. We find that Si NW-TFETs can be indeed expected to deliver $S$-values below 60 mV/dec for optimized device structures.

Index Terms—Excimer laser annealing (ELA), nanowire tunneling field-effect transistor (NW-TFET), steep-slope transistors, ultrathin-body silicon-on-insulator (SOI).

I. INTRODUCTION

SINCE the first experimental demonstration of sub-60 mV/dec operation [1], tunneling field-effect transistors (TFETs) have been extensively studied on a variety of material systems to address the demand for low-power device operation not achievable with conventional metal–oxide–semiconductor field-effect transistors (MOSFETs) [2]–[6]. For this purpose, detailed studies have been dedicated to the importance of using a 1-D geometry, such as a nanotube and nanowire (NW), to access the so-called quantum capacitance limit (QCL) [7]–[9], wherein proper drain voltage dependence is ensured in the device’s $ON$-state [10]—a feat that is not easily achievable in 2- and 3-D systems. Despite this fact, a relatively small number of studies exist on Si NW TFETs (NW-TFETs) at present [11]–[14]. Those that do typically employ vapor–liquid–solid (VLS) grown NWs that use Au catalyst materials not compatible with current manufacturing processes and have the added challenge of NW placement [11]–[13]. Ideally, Si NW-TFET arrays would be created in a top–down approach to combine the inherent scaling advantages of NWs and their capability to operate in the QCL with a complementary metal–oxide–semiconductor (CMOS) compatible process flow.

The only critical component missing then is the ability to maintain the abruptness of the doping profile at the source/channel interface for these NW-TFET arrays. A major problem that the semiconductor industry at large is facing today is precisely this issue of maintaining the abruptness of channel doping profiles in MOSFETs as dimension scale [15]. Traditional rapid thermal anneal (RTA) processing used to activate implanted dopants can lead to unacceptable diffusion in ultrathin body semiconductors, which is a problem that is exacerbated in a NW geometry [16]. To overcome this limitation, researchers have explored alternate doping techniques and diffusionless activation processes such as plasma doping (PLAD) and excimer laser annealing (ELA), respectively [17], [18]. Whereas abrupt source/drain doping profiles are important for conventional MOSFETs, they are crucial in achieving optimized band-to-band tunneling (BTBT) devices such as TFETs, whose $ON$-state currents $I_{ON}$ and inverse subthreshold slopes are ultrasensitive to the abruptness of this junction. To our knowledge, the use of ELA for enhanced BTBT devices has not been explored yet.

Herein, we present the first systematic study on top–down Si NW-TFET arrays comparing RTA, ELA, and low-temperature RTA (LT-RTA) activation and recrystallization processes to realize significantly improved BTBT behavior as measured by $I_{ON}$ and $S_{avg}$, which is the average extracted inverse subthreshold slope over three orders of magnitude change in drain current $I_D$. Simulations using the Wenzel–Kramer–Brillouin (WKB) approximation are used to compare the results with ideally expected $S_{avg}$-values, showing good agreement with the ELA + LT-RTA process and allowing the prediction of performance for aggressively scaled TFETs.

II. EXPERIMENT

NW-TFET arrays consisting of 400 NWs in parallel were fabricated on silicon-on-insulator (SOI) substrates
with a Si body thickness of 20 nm and a 100-nm-thick buried oxide (BOX) layer. Wire arrays were defined using electron-beam lithography to achieve a nominal pitch and wire width of 100 and 20 nm, respectively (see Fig. 1). Dry oxidation was used to form a gate oxide thickness of \( \sim 6 \) nm, as confirmed by cross-sectional transmission electron microscope imaging (not shown). The \( n^+ \) poly-Si gated channel region was patterned to occupy 500 nm of the 1 \( \mu \)m total wire length, leaving \( \sim 250 \) nm for the \( n^+ \)-source and \( p^+ \)-drain extension regions. The source and drain regions were implanted at energies of 5 (As) and 1.5 keV (B) with a 0° implant angle and dose of \( 1 \times 10^{15} \) cm\(^{-2} \). The implant energies were carefully chosen with simulation to ensure the presence of a sufficient seed crystal at the bottom of the NWs. A detailed description of the NW-TFET array fabrication process used in this paper was recently reported by Sandow et al. [14].

To evaluate the impact of different activation processes on TFET performance, three techniques were investigated: RTA, ELA, and ELA + LT-RTA. For RTA, a standard spike anneal at 1000 °C was used to activate the dopants and repair the damage induced by implantation. The ramp-up rate for the RTA process was 100 °C/s, and the ramp-down rate was 80 °C/s. Submelt ELA processing was performed using a KrF excimer laser (\( \lambda_{ex} = 248 \) nm) with a top-flat beam profile of \( 7 \times 7 \) mm\(^2\). The pulse duration and number of pulses was held constant at 28 ns and 100, respectively. The energy fluence was tuned in the range of 45–75 mJ/cm\(^2\) to optimize tunneling performance. The best results were obtained at 60 mJ/cm\(^2\); therefore, we restrict our discussion in this paper to this energy fluence. It is important to note that using an energy fluence level corresponding to the submelt process regime is the key to maintaining the abruptness of the as-implanted profile. Previous studies that focus on the application of ELA to NWs grown via the VLS method reported that Si NW melting occurred in the range of 60–70 mJ/cm\(^2\) in the form of beaded periodic chains of nanoparticles due to morphological instabilities [19]. Similar melting effects were not observed in our NW arrays formed on SOI at these energy levels, possibly due to the absence of Au and other morphological defects found in grown NWs that lower the melting threshold. Finally, this ELA process was combined with a 5-min 650 °C LT-RTA treatment to further improve doping abruptness at the tunnel junction. The temperature was chosen in an attempt to reduce the depth of the amorphized region and enhance the crystalline quality while minimizing both diffusion into the unimplanted Si regions (i.e., in-diffusion) and dopant deactivation [20], [21].

Electrical characterization of the TFET arrays was carried out by negatively biasing the \( p^+ \)-drain with respect to the \( n^+ \)-source to permit the staircase p-i-n structure depicted in Fig. 2. In this arrangement at a sufficiently high negative gate voltage \( V_{gs} \), the valence band in the channel is lifted above the conduction band in the source, and interband tunneling occurs at the source/channel junction, giving rise to p-type transistor behavior.

III. RESULTS AND DISCUSSION

A. Components of TFET Operation

To gain a qualitative understanding of the impact of the various geometry and material-related parameters on tunneling performance, we invoke the Landauer expression for a 1-D ballistic device, given by [22]

\[
I_D = 2 \cdot \frac{q}{h} \int_{0}^{\Phi_0} dE \cdot T(E) \left( f_s(E) - f_d(E) \right). \tag{1}
\]

To achieve reasonably high \( I_{ON} \) currents, \( T(E) \) should then be as close to unity as possible. For TFETs, the transmission can be estimated using the WKB approximation for a triangular
barrier, as shown in Fig. 2 band diagram, and is expressed as [23]

$$T_{WKB} \approx \exp \left( - \frac{4\Lambda \sqrt{2m^*E_g^3/2}}{3\hbar}(\Delta \Phi + E_g) \right),$$  \tag{2}

Fig. 3. Transfer characteristics showing tunneling currents in the p- and n-branches for three annealing conditions: (circles) 1000 °C RTA spike anneal, (squares) ELA at an energy fluence of 60 mJ/cm², and (triangles) ELA at the same conditions followed by a 650 °C LT-RTA step for 5 min. Overlapping $I_D$ and $I_S$ currents indicate the absence of gate leakage. (Inset) Extracted $S_{avg}$ vs. $T$ values over 3 orders magnitude change in $I_D$, which demonstrate < factor of 2 variation from 4 to 300 K and much larger $S_{avg}$ values than for the $KT/q$ behavior (dotted line) expected for thermal emission at lower temperatures.

Here, $q$, $h$, $E_g$, and $m^*$ represent the charge of an electron, the reduced Planck’s constant, the band gap of the material, and the transverse effective mass, respectively. The energy window $\Delta \Phi$ is the difference between the surface potential in the channel and the conduction band edge in the source, as depicted in Fig. 2, and is directly related to $V_{gs}$. Furthermore, $\Delta \Phi$ is the only operation-dependent parameter in (2). Since the remaining parameters are material dependent or physical constants, the screening length $\Lambda$ or the region over which the potential varies spatially, offers the only option in which $T_{WKB}$ can be modified for a given material choice. A decrease in $\Lambda$ leads to a larger tunneling probability, which in turn results in an effectively reduced $S_{avg}$ value. It is this link between $S_{avg}$ and $\Lambda$ that will be explored in greater detail in this paper.

B. Role of Doping Abruptness in TFETS

Given that maintaining a minimum $\Lambda$ is crucial for optimized tunneling in a particular material, a closer look at the factors that contribute to this parameter is instructive. Ideally, $\Lambda$ has two major components, namely, the screening length that arises from the depletion of the doped source region at the source/channel interface $\lambda_{dop}$ and screening from the intrinsic channel region $\lambda_{ch}$, which results from the selection of device dimensions and geometry. More specifically, $\Lambda$ is ideally defined with the following analytic expression:

$$\Lambda = \lambda_{dop} + \lambda_{ch}$$

$$= \sqrt{\frac{\varepsilon_0 \varepsilon_{body} \cdot kT}{q^2 N_D}} + \sqrt{\frac{\varepsilon_{body} E_{ox}^2}{\varepsilon_{body} - \frac{q^2}{8} \ln \left(1 + \frac{2\varepsilon_{ox}}{d_{body}}\right)}}$$  \tag{3}

for a coaxial gate geometry, where $\varepsilon_0$, $\varepsilon_{body}$, $E_{ox}$, $d_{body}$, $d_{ox}$, $N_D$, $k$, and $T$ represent the permittivity of free space, relative permittivity of the semiconductor and oxide, body thickness, oxide thickness, doping concentration in the source, Boltzmann’s constant, and operating temperature, respectively. To avoid degradation of $S_{avg}$, the Fermi level in the source should be kept between 50–100 meV above the conduction band [24], corresponding to a doping level in Si of $N_D \approx 8 \times 10^{19} - 2 \times 10^{20}$ cm⁻³. Thus, at room temperature, $\lambda_{dop} \approx 0.5$ nm and $\lambda_{dop} \ll \lambda_{ch}$ even for aggressively scaled devices.

Simply using (3) then gives the impression that the only components necessary for achieving a minimum $\Lambda$ are aggressive scaling and high-κ dielectrics. Although these are indeed necessary ingredients, these modifications alone do not necessarily sufficiently minimize $\Lambda$. The primary reason for this is that (3) assumes an infinitely abrupt doping profile at the source/channel interface. In reality, even an as-implanted doping profile for a given implant energy produces a finite “smearing” of the lateral dopant profile into the channel region, or $\Delta \lambda_{dop}$, that inhibits the grip of the channel potential over this region, effectively increasing the tunneling distance. Simulations have shown that even in planar structures, the steepness of the lateral doping profile falls off at ~3 nm/dec for As and ~5 nm/dec for B when a standard RTA 1000 °C spike anneal is used for activation [25]. Given that there is ~10-dec difference in doping at the source/channel junction for an ideal Si TFET, the degradation of $\Lambda$ and, ultimately, $S_{avg}$ may be significant if junction abruptness cannot be reasonably maintained. For this reason, alternative activation processes are explored in this paper to improve junction steepness and, consequently, $S_{avg}$.

C. Characterization of Disparately Activated NW-TFETS

The transfer characteristics for the RTA spike, ELA, and ELA + LT-RTA-annealed NW-TFETs are displayed in Fig. 3 at $V_{ds} = -0.5$ V. We briefly note that all of the devices were tested in the range of $V_{ds} = -0.05 \text{ to } -3$ V and displayed the same general trends between different activation conditions over this entire voltage range. Furthermore, noting that the purpose of a steep-slope device is to allow the transition between the ON- and OFF-states to occur over a smaller gate voltage window and that $I_{ON}$ for an optimized device is defined at $V_{gs} = V_{ds} = \text{supply voltage} V_{dd}$ [26], the choice of $|V_{ds}| = 0.5$ V was made to achieve the best possible device performance at a $V_{ds}$ value actually suitable for low-power applications. $I_D$ and $I_S$ are both included in Fig. 3 to indicate the absence of any appreciable gate leakage, which is a necessary component to ensure that the extracted subthreshold slope is, in fact, accurately represented and not artificially low as a result of $I_D$ leaking from the channel to the gate in the OFF-state. This is particularly...
important deep into the OFF-state where gate leakage can be significant and where local subthreshold slopes are often reported. Whereas a threshold voltage $V_{th}$ shift occurs between the devices that use different means of activation, device-to-device variation of $V_{th}$ for a given activation technique was found to be negligible.

Tunneling occurs in both the p- and n-branches; however, we restrict our analysis to the p-branch since, in principle, higher activation and steeper doping profiles are possible with As, compared with B doping. The inset in Fig. 3 shows the observed $S_{avg}$ vs. temperature $T$ over 3 orders of magnitude change in $I_D$ for the ELA + LT-RTA device. Always the same trend was observed—a relatively temperature independent $S_{avg}$ from 4–50 K and 150–300 K with some temperature dependence showing up in the range of 50–150 K. A number of factors can potentially contribute to this temperature-dependent range, even in a tunneling device, such as freeze out of trap-assisted tunneling centers or quenching of phonons required for tunneling through the indirect Si band gap [27], [28]. Clearly, however, the $S_{avg}$ variation is far below the predicted value for thermal emission over a gate-controlled barrier that tends toward zero, as indicated by the dotted line. In our experiment, $S_{avg}$ only changes by a factor of 1.45 to 1.65, regardless of the number of decades change in $I_D$ considered, over the entire temperature range from 4 to 300 K.

The key performance-related features to note are $I_{ON}$ and $S_{avg}$, where we define $I_{ON}$ at 1 V above threshold and $S_{avg}$ over 3 orders of magnitude change in $I_D$ from $10^{-12}$ to $10^{-9}$ A to provide a reasonable $I_{ON}/I_{OFF}$ ratio for digital applications. To more clearly illustrate the comparison of $S_{avg}$ for the different activation processes, the p-branch current in Fig. 3 is re-plotted in Fig. 4, with the characteristics shifted in $V_{gs}$ to align at $I_{OFF} = 10^{-12}$ A. $I_{ON}$ and $S_{avg}$ both demonstrate a clear and significant trend of improvement, depending on the activation process used. Here, ELA + LT-RTA shows the greatest enhancement followed by ELA and, finally, the RTA spike anneal. The extracted values are summarized in Table I.

![Fig. 4. Transfer characteristics shifted in $V_{gs}$ to align the p-branches of RTA, ELA, and ELA + LT-RTA annealed NW-TFET devices at $I_{OFF} = 10^{-12}$ A. (Dotted lines and large dots) The extraction points for $S_{avg}$ over three orders magnitude change in $I_D$.](image)

### Table I

<table>
<thead>
<tr>
<th>Process</th>
<th>$S_{avg}$ (mV/dec)$^a$</th>
<th>$V_{th}$ (V)</th>
<th>$I_{ON}$ (mA)$^b$</th>
</tr>
</thead>
<tbody>
<tr>
<td>RTA</td>
<td>601</td>
<td>-3.5</td>
<td>15.2</td>
</tr>
<tr>
<td>ELA</td>
<td>452</td>
<td>-2.9</td>
<td>21.5</td>
</tr>
<tr>
<td>ELA + LT-RTA</td>
<td>382</td>
<td>-3.0</td>
<td>77.0</td>
</tr>
</tbody>
</table>

$^a$ $S_{avg}$ values are extracted over three orders of magnitude current from $I_{OFF} = 10^{-12}$ A to $I_D = 10^{-9}$ A. $^b$ $I_{ON} = I_D$ at $|V_{th} - V_{th}| = 1$ V

![Fig. 5. Cross-sectional schematics depicting the expected relative levels of activation and dopant profile smearing at the source/channel junction under the gate for the activation conditions described in this paper, including: (a) as-implanted; (b) 1000 °C spike RTA; (c) 650 °C SPER only; (d) ELA only; and (e) ELA + 650 °C LT-RTA.](image)

**D. Performance Dependence on the Activation Process**

To gain a better understanding of the trends for the extracted figures of merit, namely $I_{ON}$ and $S_{avg}$, a qualitative description of the expected doping behavior, in terms of activation and junction steepness, during the various processes is necessary. Typically, the degree of activation and cross-sectional area of current flow are considered as key ingredients for achieving high $I_{ON}$ values in conventional MOSFETs. This logic alone, however, is not sufficient to explain the current in tunneling devices since a 1000 °C RTA spike anneal should, in principle, favorably compare in terms of $I_{ON}$ from this perspective. By contrast, TFETs are electrically very sensitive to $\Lambda$, as evidenced by the results presented in Fig. 4 and discussed in Sections III-A–C; therefore, over and above the level of activation and cross-sectional area of current flow, lateral doping abruptness at the source/channel junction is paramount in tunneling devices. Ultimately, it is the competition and interplay between these physical manifestations that results in the observed characteristics.

Fig. 5 provides a simplified picture of the expected activation levels and doping steepness for the different processes. Certain features have been exaggerated for clarity in the foregoing arguments. The as-implanted profile shown in Fig. 5(a) is expected to exhibit a large degree of amorphized Si with a very
low level of activation [20]. The existence of a seed crystal at the bottom of the NW is crucial in avoiding higher defect densities that arise from complete amorphization [29]. Fig. 5(b) depicts the situation for the RTA spike anneal, where significant diffusion toward the unimplanted Si regions at the bottom of the NW and laterally into the channel region is expected [30]. This has the effect of increasing the cross-sectional area of current flow through the NW while achieving reasonably high levels of activation but at the expense of a significant “smearing” of the dopant profile at the source/channel interface, which effectively increases $\Lambda$ by reducing the abruptness of the transition between the source and the gated channel region, as discussed in Section III-B. By contrast, it is known that a single 650 °C solid-phase epitaxial regrowth (SPER) anneal, which is akin to the LT-RTA process, results only in dopant diffusion toward the implanted surface [20]. During this process, a fraction of the As dopants are “snow ploughed” toward the Si/gate SiO$_2$ interface, while many of the As dopants at the same time relocate onto Si lattice sites within the implanted region and become electrically active during the recrystallization process. Importantly, this diffusion of dopants toward the Si/gate SiO$_2$ interface and negligible in-diffusion act to sharpen the borders of the doping profile beyond the as-implanted profile, as pictured in Fig. 5(c), creating a box-like dopant profile at the border of the implanted/unimplanted region. This process then effectively minimizes $\Lambda$, enhancing the probability of tunneling transmission at the source/channel interface. Additionally, SPER is a nonequilibrium process that has demonstrated the ability to overcome the solid-solubility limit imposed on the conventional RTA process [31]. Activation using submelt ELA is also capable of achieving activation levels in excess of the solid-solubility limit and doping profiles with only subtle variations in steepness from the as-implanted profile, as displayed in Fig. 5(d) [18]. At first, this process alone may appear to provide an adequate option; however, similar to the SPER case, this nonequilibrium method leads to a metastable state where deactivation can occur with subsequent thermal treatments between 500 °C–800 °C common in CMOS processing [32], [33]. Furthermore, laser activation treatments, such ELA, are known to suffer from poorer local atomic order [30]. This combined with the smaller activated cross section can potentially lead to a relatively large series resistance in a NW geometry, which may be responsible for the comparatively pronounced saturation of $I_D$ deeper into the ON-state of the ELA only device.

By combining ELA and LT-RTA [see Fig. 5(e)], an optimal balance between lateral profile steepness, activation, and cross-sectional current flow may be realized, promoting optimized tunneling through the source/channel barrier. Several detailed studies have been dedicated to the benefits of using laser annealing (LA) in conjunction with RTA processing, compared with an RTA spike anneal or LA alone [21], [30], [34]. Submelt ELA promotes high activation with minimal diffusion. In this way, many As dopants are already active on substitutional sites prior to LT-RTA. The subsequent LT-RTA treatment, which clearly improves $I_{ON}$ and $S_{avg}$ in the TFET arrays, may enhance tunneling through a number of mechanisms, including a sharpened dopant profile, improved crystallinity, minimization of the depth of the amorphous region, and/or higher activation. Further investigation is needed to quantify and isolate the effects of each of these potential contributing factors. In any case, the selection of 650 °C for LT-RTA should provide access to the “sweet spot” where deactivation is minimized and in-diffusion is negligible [20], [21]. The outcome is a highly activated source doping with a sharp source/channel junction, as electrically verified by the results presented in Fig. 4 and Table I. Further optimization of the LT-RTA process may be possible by using smaller time durations, although there is still some debate as to whether deactivation occurs during recrystallization of the amorphous layer or in succession to this process for SPER [35], [36]. In the present case, where ELA has already recrystallized much of the NW, even smaller times than are typically used in SPER processing may be sufficient. Regardless, if the LT-RTA time duration can be properly tuned, further optimization of TFET performance is possible.

**IV. SIMULATION AND SCALING**

The evolution in device performance for the activation techniques presented herein demonstrates an unambiguous trend of improvement in moving from the standard RTA process to a combination of ELA and LT-RTA; however, this raises the broader question of whether this process enhancement is sufficient enough to ultimately achieve a sub-60 mV/dec $S_{avg}$ with aggressive scaling.

In order to answer this question, a model that adequately captures the key contributing factors to device operation over a wide range of $d_{body}$ and $d_{ox}$ is essential. To accomplish this task, we employ a more encompassing version of the Landauer formalism presented in (1), wherein, for a given channel potential $\Phi_f$, the number of contributing modes $M(E)$, is analytically determined by considering the mode spacing that arises due to size quantization in the NW, rather than simply assuming a single mode, as in (1). In this way, the appropriate number of modes is assigned for a given NW diameter at a given value of $\Phi_f$. In addition to size quantization effects, our model also considers the influence of $V_{gs}$ on $\Phi_f$ to capture the transition from the classical limit into the QCL. In a well-behaved fully depleted device, the total gate capacitance $C_q$ is given by $C_q = C_{ox}C_{avg}$, where $C_q$ is the quantum capacitance or change in channel charge with changing surface potential [9]. The relationship between $V_{gs}$ and $\Phi_f$ is given by

$$\delta \Phi_f = \frac{C_{ox}}{C_{ox} + C_q} \cdot q\delta V_{gs}.$$  (4)

In 1-D systems, the QCL can be reached, where $C_q < C_{ox}$, and the gate capacitance is dominated by $C_q$. Consequently, $\delta \Phi_f/q\delta V_{gs}$ approaches unity even in the devices ON-state. In our model, the actual band movement is accounted for by determining $C_q$ for each $V_{gs}$ using the expression

$$C_q = q^2 \delta \Phi_f \int_0^\infty D(E) \cdot T_{\text{WKB}}(E)$$

$$\times (f_S(E,T) + f_d(E - qV_{ds},T)dE.$$  (5)
Importantly, the transmission probability that appears in (5) involves BTBT. This is in contrast to the conventional MOSFET case where transmission is only a function of scattering. The result is that the QCL can be more easily accessed in a device that employs tunneling since the smaller tunneling probability lowers $C_q$, compared with $C_{ox}$ for a given set of $d_{ox}$ and $d_{body}$. Using (2) and (3), we calculated $q$ by (5) followed by the modified Landauer expression in an iterative fashion to generate $I_D-V_{gs}$ curves. Since the gating arrangement of the top–down TFETs is more accurately described by an Ω-gate structure rather than a fully coaxial gate geometry, $\lambda_{ch}$ in (3) was adjusted by a factor of $\sqrt{4/3}$ to account for the fact that only 75% of the channel circumference is covered by the gate. Using this approach, $S_{avg}$ was determined by averaging the slope of these curves over 3-dec change in $I_D$ from $10^{-12}$ to $10^{-15}$ A to match the experimental data for comparison.

The results are represented by the solid line in Fig. 6. At $d_{body} = 20$ nm, the simulated curve has a value of 350 mV/dec, compared with 382 mV/dec for the dual process ELA + LT-RTA device, a ∼9% deviation in contrast to the ∼72% variation for the RTA spike annealed sample. By adding $\Delta \lambda_{dop}$ to (3), the effective physical smearing of the dopant profile at the source/channel interface was estimated for each of the activation processes by adjusting $\Delta \lambda_{dop}$ until a match was obtained between the simulated and experimentally extracted $S_{avg}$ values shown in Table I. The outcome was $\Delta \lambda_{dop}$–RTA ≈ 9.2 nm $> \Delta \lambda_{dop}$–ELA $> \Delta \lambda_{dop}$–ELA + LT–RTA ≈ 1.1 nm. These values assume that the degradation of the subthreshold slopes is primarily due to the dopant profile smearing problem. In reality, a number of other factors can potentially contribute to the deviation in $S_{avg}$ as well, including interface trap and parasitic fringe capacitance contributions. Given that similarly fabricated NW MOSFETs demonstrated an $S \sim 64$ mV/dec (close to the thermal limit), we rule out any significant capacitance contributions and attribute the majority of the departure of $S_{avg}$ from the ideal expected values to dopant smearing due to the activation and implant processes. One fortunate outcome is that, contrary to most scaling processes, the smearing of the source/channel doping profile will improve as NW dimensions scale, and smaller implant energies are required, resulting in a tighter lateral distribution that yields a smaller $\Lambda$. In other words, $S_{avg}$ is expected to get closer to the ideal theoretical limit as the NW diameter scales.

Based on the fact that the choice of $d_{body}$ and $d_{ox}$ used in this paper are not sufficient to reach the sub-60 mV/dec range, further scaling measures are obviously required. It was found that a combination of aggressively scaled high-$k$ dielectrics and small NW diameters are required to access sub-60 mV/dec inverse subthreshold slopes (see dotted curve in Fig. 6). To simulate this curve, the following assumptions were made: 1) 3-dec change in $I_D$ with $I_{OFF}$ adjusted from $10^{-12}$ to $3.3 \times 10^{-13}$ A (reliable data is available for the ELA + LT-RTA device at this current level, as shown in Figs. 3 and 4); 2) a gate-all-around (GAA) geometry; and 3) a 3-nm-thick HfO$_2$ gate dielectric with a conservatively estimated dielectric constant of ∼15. This curve was only extended to 7 nm to reflect the fact that below this diameter, $E_g$ increases in a 1/$d_{body}$ fashion due to quantization effects [37]. Here, a projection of $S_{avg} \approx 39$ mV/dec is possible at $d_{body} = 7$ nm, a substantial improvement over the thermal limit, with sub-60 mV/dec operation possible for $d_{body} < 13$ nm. This result is encouraging in the sense that the feasibility of creating such Si NW arrays has already been experimentally demonstrated [38] and suggests in combination with our findings a route toward Si-based steep-slope devices.

V. CONCLUSION

$S_{avg}$ has been experimentally extracted from top–down NW-TFET arrays that were fabricated employing processing methods that are compatible with state-of-the-art CMOS techniques. By using ELA in conjunction with an LT-RTA activation process to preserve doping abruptness at the source/channel tunneling interface, a significant improvement in $S_{avg}$ and $I_{ON}$ were achieved. $S_{avg}$ for ELA + LT-RTA was shown to be within ∼9% of the theoretical limit for a perfectly abrupt doping profile. Furthermore, our simulations have shown that future work using this activation process in conjunction with NW-TFETs that have a smaller diameter, GAA architecture, and HfO$_2$ gate dielectric will enable the realization of sub-60 mV/dec operation on a Si platform.

ACKNOWLEDGMENT

The authors would like to thank the Birck Nanotechnology Center and Peter Gruenberg Institute for structural support.

REFERENCES


Joshua T. Smith received the B.S. degree in electrical engineering from Arizona State University, Tempe, AZ, in 2004. He is currently working toward the Ph.D. degree in electrical engineering at Purdue University, West Lafayette, IN.

In 2005, he was a Product Engineer with ON Semiconductor. Early during his Ph.D. program from 2006 to 2007, he worked on templating processes for the integration of carbon-nanotube-based devices into nanoelectronic platforms. His current research focuses on experimental realization of low-power devices on Si and SiGe nanowire platforms.

Mr. Smith was a recipient of the 2007 National Science Foundation Graduate Research Fellowship Award.
Saptarshi Das was born in Kolkata, India, in 1986. He received the B.S. degree from Jadavpur University, India. He is currently working toward the Ph.D. degree in electrical engineering at Purdue University, West Lafayette, IN.

In the summer of 2009, he worked as an Intern with IBM T. J. Watson Research Center. His research focuses on device and circuit modeling and optimization of novel nanotransistors for radio frequency applications. He also works on the experimental realization of ferroelectric low-power transistors.

Christian Sandow received the M.S. degree in physics from the University of Bonn, Bonn, Germany in 2005 and the Ph.D. degree in physics from the Technical University of Aachen, Aachen, Germany, in 2010. His Ph.D. dissertation investigated the fabrication and modeling of silicon nanowire tunnel field-effect transistors (FET).

Since 2010, he has been with PNSensor GmbH, in Munich, Germany, as a Research Staff Member. His current research interests include modeling and fabrication of active silicon pixel detectors (depleted P-FET) as well as transport in nanostructures, particularly steep-slope devices for future low-power device applications.

Dr. Sandow was the recipient of a nine-month research scholarship from the German Academic Exchange Service (DAAD) to conduct a portion of his Ph.D. research at Purdue University, West Lafayette, IN, from 2008 to 2009.

Renato A. Minamisawa received the B.Sc. degree in medical physics from the University of São Paulo, Ribeirão Preto, Brazil, in 2005 and the M.Sc. degree in physics from the Alabama Agricultural and Mechanical University (AAMU), Huntsville, in 2008. He is currently working toward the Ph.D. degree in physics at the Rheinisch-Westfälische Technische Hochschule Aachen University, Jülich, Germany.

From 2006 to 2008, he was with the Center of Irradiation of Materials, AAMU, working on ion beam nanofabrication of high-performance porous membranes and on the fabrication and characterization of thermoelectric devices. In 2008, he joined the Jülich Research Center, Jülich, where his current research focuses on the fabrication of strained silicon by ion implantation of thin virtual substrates, on the doping of strained Si nanowires on insulator, and on the fabrication and investigation of carrier transport in nanoscale strained silicon–germanium channel complementary metal–oxide–semiconductor devices with novel high-κ gate dielectrics.

Mr. Minamisawa is a Student Member of the Material Research Society. He received the honor mention award at the Simpósio Internacional de Iniciação Científica da Universidade de São Paulo, Brazil, in 2003, and the Alabama Experimental Program to Stimulate Competitive Research Program Graduate Research Scholar Award in 2008.

Joerg Appenzeller (M’02–SM’04–F’10) received the M.S. and Ph.D. degrees in physics from the Technical University of Aachen, Aachen, Germany, in 1991 and 1995, respectively. His Ph.D. dissertation investigated quantum transport in low-dimensional systems based on III/V heterostructures.

He worked for one year as a Research Scientist with the Research Center Jülich, Jülich, Germany, before he became an Assistant Professor with the Technical University of Aachen, in 1996, working on electronic transport in carbon nanotubes and superconductor–semiconductor hybrids. From 1998 to 1999, he was with the Massachusetts Institute of Technology, Cambridge, as a Visiting Scientist, exploring the scaling limits of Si metal–oxide–semiconductor field-effect transistors. From 2001 to 2007, he was with the IBM T. J. Watson Research Center, Yorktown Heights, NY, as a Research Staff Member, where he was mainly involved in the investigation of carbon nanotubes and silicon nanowires for future nanoelectronics. Since 2007, he has been a Professor of electrical and computer engineering with Purdue University, West Lafayette, IN, and the Scientific Director of Nanoelectronics with the Birck Nanotechnology Center. His current interests include novel devices based on nanomaterials such as nanowires, nanotubes, and graphene.

Siegfried Mantl (A’97–M’04) received the Ph.D. degree in physics from the University of Innsbruck, Innsbruck, Austria, in 1976.

Since 1971, he has been with the Research Center Jülich, Jülich, Germany. He is currently the Head of the Ion-Beam Technique Division of the Peter Gruenberg Institute 9, Research Center Jülich, and a Professor of physics with Aachen University of Technology (RWTH Aachen). His research interests include the investigation of Si- and Ge-based nanoelectronic devices, ion beam techniques, and various thin film growth methods. Specifically, nanowires and strained heterostructures for novel transistors, such as short-channel field-effect transistors, Schottky barrier, and tunnel metal–oxide–semiconductor field-effect transistors are under investigation. He is the author or coauthor more than 250 journal articles, book chapters, and review articles and the holder of over 20 patents.