

# Tuning Resistive Switching Behavior by Controlling Internal Ionic Dynamics for Biorealistic Implementation of Synaptic Plasticity

Sangmin Yoo, Yuting Wu, Yongmo Park, and Wei D. Lu\*

Memristive devices have demonstrated rich switching behaviors that closely resemble synaptic functions and provide a building block to construct efficient neuromorphic systems. It is demonstrated that resistive switching effects are controlled not only by the external field, but also by the dynamics of various internal state variables that facilitate the ionic processes. The internal temperature, for example, works as a second-state variable to regulate the ion motion and provides the internal timing mechanism for the native implementation of timing- and rate-based learning rules such as spike timing dependent plasticity (STDP). In this work, it is shown that the 2nd state-variable in a  $Ta_2O_5$ -based memristor, its internal temperature, can be systematically engineered by adjusting the material properties and device structure, leading to tunable STDP characteristics with different time constants. When combined with an artificial post-synaptic neuron, the 2nd-order memristor synapses can spontaneously capture the temporal correlation in the input streaming events.

# 1. Introduction

The exponential growth of integrated circuits' density and functionality due to transistor scaling has recently slowed down as the device size approaches sub-10 nm.<sup>[1]</sup> To meet the demands of modern applications such as artificial intelligence, autonomous vehicles, and Internet-of-Things, advanced devices and computing architectures are needed to achieve efficient largescale data analysis and storage in real time. Memristive devices offer high scalability, non-volatile storage, and rich switching dynamics that make them appealing candidates for in-memory computing and neuromorphic computing systems.<sup>[2–4]</sup> A standard memristor is a two-terminal device with a switching medium between the two electrodes (**Figure 1**a). It stores data in the form of different conductance values, as a result of internal ion (e.g., oxygen vacancy  $V_0$ ) re-distribution in the medium driven by the externally applied electric field.<sup>[5,6]</sup> Meanwhile,

S. Yoo, Y. Wu, Y. Park, W. D. Lu Department of Electrical Engineering and Computer Science The University of Michigan Ann Arbor, MI 48109, USA E-mail: wluee@umich.edu

The ORCID identification number(s) for the author(s) of this article can be found under https://doi.org/10.1002/aelm.202101025.

DOI: 10.1002/aelm.202101025

Joule heating induced during the process exponentially increases the drift velocity and the diffusivity of oxygen vacancies, and thus plays a critical role in the resistive switching (RS) process.<sup>[7]</sup> In our previous work,<sup>[8]</sup> we have demonstrated that the internal device temperature T can be considered as a 2nd state-variable, which though not directly measurable can strongly affect the 1st state-variable (the size of the oxygen vacancy-based filament) evolution and the RS characteristics. Since the internal temperature rises during programming and spontaneously decays after removal of the applied pulses, heat can be accumulated from multiple pulses, depending on how closely the pulses are applied. Thus, the device conductance change is not only determined by the present pulse but also by the relative timing of the pulses in the near history, allowing

these so-called 2nd-order memristive devices to natively implement important synaptic plasticity effects such as spike timing dependent plasticity (STDP). Essentially, the spontaneous decay of the internal temperature can be regarded as playing a similar role as the Ca<sup>2+</sup> concentration in a biological synapse<sup>[9]</sup> to decode spike timing information, making it possible to achieve bio-realistic implementation of neuromorphic systems.<sup>[10]</sup>

However, previous studies on 2nd order memristors lack controllability of the 2nd state-variable. Instead, different shapes of signals<sup>[3,11,12]</sup> or additional heating pulse were used to achieve the desired STDP function.<sup>[8,13]</sup> In this work, through detailed physical modeling and experiments, we study methods to control the 2nd state-variable (internal temperature) and its effects on the switching dynamics of a 2nd-order memristor, including how the STDP time constant can be modified. When using the natively implemented STDP learning rule for unsupervised learning in a spiking neural network,<sup>[14]</sup> we show the 2nd-order devices can naturally uncover the correlation pattern in input spiking events.

# 2. Experimental Section

#### 2.1. Device Model

The COMSOL Multiphysics model was developed following the previous work,<sup>[15]</sup> and extended it to 3D device structures.







**Figure 1.** Schematic device structures for a device a) without and b) with the HI layers. c) Top view of a fabricated HI-device. d) Oxygen vacancy and e) temperature profile of the two cases, showing the device states immediately after forming. f) Experimentally measured DC *I–V* curves of the two types of devices. g) Simulated DC *I–V* curves of the two types of devices from the COMSOL Multiphysics model. Analog conductance modulation results through consecutive pulses for the h) standard device and i) HI-device.

This allowed to tune the electrode material properties and the structural dimension to study their effects on the internal dynamics of the 2nd-order memristor. It was assumed that the top and bottom electrodes were effective heat sinks so that 1  $\mu$ m away from the device the temperature of the electrode reached room temperature. The memristive devices and interconnecting wires were assumed to be surrounded by silicon dioxides. The peak temperature value in the switching layer was obtained from the temperature profile simulation at given time instances. Material properties, including heat capacity, thermal conductivity, and mass density were chosen from reported thin film values from literature.<sup>[16–18]</sup> The thermal conductivity was obtained using the following reported model as a function of T:<sup>[19]</sup>

$$k = 0.2T + 4(W/(m \cdot K))$$
 (1)

Two separate physical structures were used as schematically shown in Figure 1a,b to model devices with and without the NiCr-based HI layer, respectively. Other physical assumptions were identical to the previous work.<sup>[15]</sup> The parameters used are listed in **Table 1**.

 $\ensuremath{\textbf{Table 1.}}$  Material properties used in the COMSOL Multiphysics simulation.

| Material | Specific Heat [J kg <sup>-1</sup> K <sup>-1</sup> ] | Thermal conductivity<br>[W m <sup>-1</sup> K <sup>-1</sup> ] | Density[kg m <sup>-3</sup> ] |  |
|----------|-----------------------------------------------------|--------------------------------------------------------------|------------------------------|--|
| NiCr     | 380                                                 | 17                                                           | 7750                         |  |
| Pd       | 240                                                 | 71.2                                                         | 1202                         |  |

#### 2.2. Device Fabrication

TaO<sub>x</sub> based memristors with different areas were fabricated through photolithography. The layout for the standard devices included memristors with sizes from 1 to 100 µm<sup>2</sup>. Device fabrication started with 35 nm Pd bottom electrode deposition by photolithography, e-beam evaporation, and lift-off. It was followed by sputtering of 30 nm TaO<sub>x</sub> using a Ta metal target in an Ar/O<sub>2</sub> gas mixture (3% O<sub>2</sub>) at 400 °C. The pressure of the gas was  $\approx 5$  mTorr. A 4-nm Ta<sub>2</sub> O<sub>5</sub> switching layer was then deposited by RF sputtering using a Ta<sub>2</sub> O<sub>5</sub> ceramic target in Ar with a pressure of  $\approx 5$  mTorr. Top electrode was deposited in the same way as the bottom electrode.<sup>[8,20]</sup> For the HI-device, 40 and 70 nm NiCr HI layers were added before the deposition of the bottom electrode and after that of the top electrode, respectively. The thickness of the top electrode was increased to 60 nm to ensure continuous connection over the step edges of the added HI layer.

#### 2.3. Correlation Detection Simulation

The inputs had the same average firing rate (*r*) of ~500 Hz. Group 1 (neuron 0–9) had the pairwise correlation parameter (*c*) of around 0.1, and group 2 (neuron 10–19) had correlation of around 0.2. The spiking events of the other 80 neurons were uncorrelated. The generation of the correlated spiking inputs followed the methods described in ref. [21]. The correlated spike trains  $X_k(t)$  were produced by conditioning their firing probabilities on the activity of a common reference spike train  $X_O(t)$ . The equations of the conditional probabilities are shown below.

$$\vartheta = P\left(X_{k}\left(T\right) = 1 \mid X_{o}\left(T\right) = 1\right) = r\Delta T + \sqrt{c}\left(1 - r\Delta T\right)$$
(2)

www.advancedsciencenews.com

 $\frac{dV_{m}\left(t\right)}{dt}=$ 



(8)

$$\varphi = P(X_k(T) = 1 | X_o(T) = 0) = r\Delta T (1 - \sqrt{c})$$
(3)

The pairwise correlation between two generated spike trains  $X_i(t)$  and  $X_j(t)$  can be calculated as following, which is equal to *c*.

$$E[X_{i}X_{j}] = \sum_{m=0,1} P(X_{i}(T) = 1 | X_{o}(T) = m) * P(X_{j}(T) = 1 | X_{o}(T) = m)$$

$$= \vartheta^{2}r\Delta T + \varphi^{2}(1 - r\Delta T) = (r\Delta T)^{2} + c(r\Delta T)(1 - r\Delta T)$$
(4)

$$Corr\left(X_{i}, X_{j}\right) = \frac{\operatorname{cov}\left(X_{i}, X_{j}\right)}{\sqrt{\operatorname{Var}\left(X_{i}\right)\operatorname{Var}\left(X_{j}\right)}}$$

$$= \frac{E\left[X_{i}X_{j}\right] - E\left[X_{i}\right]E\left[X_{j}\right]}{\sqrt{\left(E\left[X_{i}^{2}\right] - E\left[X_{i}\right]^{2}\right)\left(E\left[X_{j}^{2}\right] - E\left[X_{j}\right]^{2}\right)}}$$

$$= \frac{(r\Delta T)^{2} + c(r\Delta T)(1 - r\Delta T) - (r\Delta T)^{2}}{(r\Delta T)(1 - r\Delta T)} = c$$
(5)

The artificial neuron was modeled as a leaky-integrate-and-fire (LIF) neuron. The membrane potential was updated every 2  $\mu$ s according to Equation (6), where  $\tau_{\rm m}$ = 100  $\mu$ s and the threshold voltage was set as 5.

A multiplicative STDP learning rule was used for numerical simulation of conductance updates, where the parameters were fitted to the measurement data (area =  $1 \,\mu m^2$  with HI layer) in **Figure** 2e. The parameters were set as following:  $A_p = 0.23$ ,  $A_d = 0.23$ ,  $\tau_p = 56.3 \,\mu$ s,  $\tau_d = 123.2 \,\mu$ s,  $\eta = 0.01$ , stdp\_window = 200  $\mu$ s.

$$w = \begin{cases} w - \eta * w * A_{d} * \exp\left(\frac{\Delta t + 100ns}{\tau_{d}}\right), -200\mu s \le \Delta t < 0\\ 0, \Delta t = 0 \\ w + \eta * (1.0 - w) * A_{p} * \exp\left(\frac{\Delta t - 100ns}{\tau_{p}}\right), 0 < \Delta t \le 200\mu s \end{cases}$$
(7)

# 3. Results and Discussions

As discussed earlier,<sup>[8]</sup> a memristor's 2nd state variable can play a critical role to the evolution of the 1st state variable and the resulting RS characteristics. For example, the internal temperature *T* exponentially changes the ion diffusivity (*D*) and drift velocity ( $\nu$ ):

$$-\frac{v_{n}}{t_{m}} + \sum_{i=1}^{n} w_{i} X_{i}(t)$$

$$(6) D = \frac{1}{2} \cdot a^{2} f \exp\left(-\frac{E_{s}}{kT}\right)$$

$$(9) D = \frac{1}{2} \cdot a^{2} f \exp\left(-\frac{E_{s}}{kT}\right)$$

$$(9) D = \frac{1}{2} \cdot a^{2} f \exp\left(-\frac{E_{s}}{kT}\right)$$

$$(9) D = \frac{1}{2} \cdot a^{2} f \exp\left(-\frac{E_{s}}{kT}\right)$$

**Figure 2.** Visualization of the internal heat dissipation dynamics for a) an HI-device and b) a standard device. The temperature profile was calculated at different time instances after the removal of a programming pulse. c) Internal peak temperature evolutions for the two devices obtained through COMSOL simulation. The peak temperature was recorded immediately after the removal of a programming pulse. d) STDP characteristics measured from the standard and the HI-device with  $1 \mu m^2$  area, respectively. e) STDP characteristics measured from the HI-device having different areas.



ADVANCED SCIENCE NEWS \_\_\_\_\_ www.advancedsciencenews.com

where *f* is the escape-attempt frequency, *a* is the effective hopping distance, and  $E_a$  is the activation energy for V<sub>O</sub> migration.<sup>[15]</sup> These parameters and temperature gradient drive the oxygen vacancy drift/diffusion, as described below:

$$\frac{\mathrm{d}n_{\mathrm{D}}}{\mathrm{d}t} = \nabla \cdot \left( D \nabla n_{\mathrm{D}} - \nu n_{\mathrm{D}} + D S_{\mathrm{n}_{\mathrm{D}}} \nabla T \right) \tag{10}$$

where  $n_{\rm D}$  is the local oxygen vacancy density,  $D\nabla n_{\rm D}$  and  $vn_{\rm D}$  are the Fick diffusion flux and the drift flux terms, and  $DS_{n_{\rm D}}\nabla T$ corresponds to the Soret diffusion effect, respectively.<sup>[15,22]</sup> Thus, a memristor's RS characteristics are strongly affected by the internal Joule heating and heat dissipation dynamics, which can be mathematically described as

$$\frac{\mathrm{d}Q}{\mathrm{d}t} = -\frac{kA(T_1(t) - T_2)}{d} = c\rho \frac{\mathrm{d}T(t)}{\mathrm{d}t} \tag{11}$$

where *k* is the thermal conductivity, A is the area of the device, *d* is the distance between two temperature points  $T_1$  and  $T_2$ , *c* is the specific heat, and  $\rho$  is the material density (Figure 1a). Here  $T_1$  represents the switching region temperature, and can be calculated as a function of *t*:

$$T_{1}(t) = (T_{1}(0) - T_{2}) \exp\left(-\frac{kA}{c\rho d}t\right) + T_{2}$$
(12)

where  $T_2$  is the ambient temperature. From Equation (12), the time *t* required for  $T_1$  to reach a specific temperature can be derived:

$$t = \ln\left(\frac{T_1(0) - T_2}{T_1(t) - T_2}\right) \frac{c\rho d}{kA}$$
(13)

where the dimensions of the device structure and the intrinsic material properties (*c*,  $\rho$ , *d*, *k*, and *A*) are known beforehand. Based on this understanding, we aim to control the internal temperature dynamics by adjusting these material and device parameters.

Since temperature rise and heat dissipation normally occur very fast (e.g., nanoseconds), it is generally desirable to slow down the temperature dynamics to more efficiently take advantage of the 2nd-order effects. For example, if the devices are used to directly process biological spike trains, a time constant on the order of ms is desired. To achieve a longer time constant, Equation (13) suggests that heat conduction paths with smaller A and lower k are required. Experimentally, we inserted heat insulating (HI) layers (NiCr), whose thermal conductivity value is about one-fourth of the top and bottom electrode (Pd), between  $T_1$  and  $T_2$  on both sides (Figure 1b) to test k dependency. We also tested devices with different sizes, from 1, 4, 20, and 40 to 100  $\mu$ m<sup>2</sup>, to verify A dependency. Figure 1b,c shows the schematic cross section view and the top view of such a thermally enhanced device having HI layers in the extended top and bottom electrodes, respectively.



A detailed device model<sup>[15]</sup> based on the COMSOL Multiphysics tool was used to calculate the temperature profile and the oxygen vacancy density at various stages during device switching. Figure 1d shows the Vo profile of the devices immediately after the forming process, with the upper panel showing the reference device without the HI layer (standard device), and the lower panel showing the device with the inserted HI layers (HI-device). The two cases show different Vo distribution profiles that lead to different conductance values. Figure 1e plots the internal temperature distribution immediately after filament formation for the two cases. Thanks to the HI layers, the proposed HI-device can maintain the induced heat better, as shown in the lower panel. The enhanced thermally assisted ion diffusion/drift in turn results in the wider Vo filament in the switching layer of the HI-device, compared with the standard device (Figure 1d).

Figure 1f shows the electrical measurement results for a standard device and a HI-device, respectively. Compared with the standard device, the HI-device can be switched at a lower voltage and results in a higher on-current, which can be explained by its better heat accumulation capability as supported by the simulated temperature profiles and V<sub>O</sub> distribution profiles discussed above (Figure 1d,e). Similarly, better heat trapping also allows the HI-device to reset at a lower voltage, due to its elevated mobility at the elevated internal temperature, as explained by Equation (9). These experimental RS characteristics are in turn reproduced and supported by the Multiphysics simulation results (Figure 1g), for the same processes and device structures.

The Ta<sub>2</sub>O<sub>5</sub>-based memristors also exhibit analogue conductance modulations (Figure 1h,i), when stimulated with consecutive potentiation or depression pulses. In the measurements, 100 depression pulses were applied first to induce long-term depression (LTD), followed by 100 potentiation pulses to induce long-term potentiation (LTP). Due to better heat accumulation, the HI-device (Figure 1i) shows a wider conductance modulation range, even though pulses with lower amplitude (0.6 V/0.85 V) were applied during the LTP/LTD processes.

Temporal properties of these 2nd order memristors are also verified both computationally and experimentally. Simple, nonoverlapping square pulses were used in these tests, as depicted in Figure 3a. Pre- and post- synaptic pulses were applied to the top and bottom electrode, respectively. Each pulse is designed to have a low amplitude that by itself is not enough to evoke conductance change. As discussed earlier, the time gap between pulses ( $\Delta t$ ) plays a critical role in determining the internal temperature and allows the device to naturally decode input timing information. Figure 3b shows the simulated internal device temperature induced by two pulses with different  $\Delta t$ . As expected, smaller  $\Delta t$  leads to higher internal temperature because the residual heat from the 1st pulse is largely accumulated during the 2nd pulse. To characterize the temporal dynamics of the heat decay process, we define the time window as the time gap between the two pulses in a pulse pair within which a measurable conductance change (i.e.,  $1 \mu S$ ) can be induced. When  $\Delta t$  is within the heat decay's time window the sufficiently elevated temperature experienced at the 2nd pulse can induce a conductance change (Figure 3c). In particular, the polarity and amplitude of the conductance change depend on







**Figure 3.** a) Schematic of the pre- and post-synaptic pulse pairs with a time gap ( $\Delta t$ ) for STDP studies. b) Simulated internal temperature dynamics induced by pulse pairs, for two different  $\Delta t$  cases, where the maximum temperature inside the device was recorded as a function of time. c) Simulated (blue and red dots) and measured (green dots) STDP behaviors natively implemented in the standard device with device area 100  $\mu$ m<sup>2</sup>. d) Long term conductance changes induced by sequentially applying pulse pairs, for two scenarios with  $\Delta t$  of 100 ns and 10  $\mu$ s, respectively.

the sign and value of  $\Delta t$ , leading to the natively implemented spike-timing dependent plasticity rules, as shown in Figure 3c. The experimental data are consistent with simulation results based on the Multiphysics device model, further supporting the roles of the internal dynamics in native STDP implementation.

More results can be obtained by repeatedly applying the STDP pulse pairs and observe the device response. The first half of Figure 3d shows the device response to 80 pulse pairs with a preceding post-synaptic pulse followed by a pre-synaptic pulse, while the 2nd half showed subsequent response to 80 pulse pairs with a preceding pre-synaptic pulse followed by a post-synaptic pulse. The measurements were also performed for two  $\Delta t$  conditions. While pulse pairs with a 100 ns time gap causes clear conductance modulations, the 10 µs time gap pulse pairs cannot evoke conductance modulation even after repeated applications. These measurements again illustrate the importance of the short-term dynamics of the internal temperature to achieve STDP characteristics.

The device model further suggests how to tune the internal short-term heat dynamics. For example, based on Equation (13), the heat dynamics depend on material properties such as the heat capacity *c*, mass density  $\rho$ , thermal conductivity *k*, and device geometry *A* and *d*. In the following, we discuss experimental and simulation results by tuning these parameters to control the 2nd order memristor's time constant and the STDP behavior.

First, a larger device area facilitates faster cooling, as evidenced in Equation (13). **Figure** 4a shows the simulated thermal distribution inside memristors having different electrode areas of 1 and 100  $\mu$ m<sup>2</sup>, respectively. The results are captured at the beginning of the forming process. One can see that the peak temperature, which drives the ion migration, is significantly lower in the larger area device. This effect may be explained by the fact that the electrodes act as heat sinks to dissipate heat produced by Joule heating, so devices with wider electrodes will have much faster heat dissipation than devices with narrower

ones, thanks to its larger thermal conductance  $\left(\frac{kA}{d}\right)$ .<sup>[23]</sup>

STDP characteristics measured from standard devices with different areas support these simulation results (Figure 4b). In

the measurement, each conductance change value is calculated after the application of 100 pulse pairs, comprised of a presynaptic pulse of 0.9 V and a post-synaptic pulse of 0.8 V with 100 ns pulse width, at a given  $\Delta t$ , and the results are averaged from 3 such measurements to reduce variations. The device is then reset to the same initial condition for the next measurement.  $t_1$  and  $t_{100}$  represent time windows of the 1 and 100 µm<sup>2</sup> device, respectively. As expected, the device with a smaller electrode area (1 µm<sup>2</sup>) can afford a longer  $\Delta t$  (time window) for STDP owing to the slower cooling effect, while the device with a larger electrode area (100 µm<sup>2</sup>) requires a much shorter time window. Figure 4c plots the dependency of the required time window on the device area: the experimentally measured time window is inversely proportional to the device area, agreeing with the analytical expression in Equation (13).

Besides the time window  $\Delta t$  required for minimum conductance change, the characteristic time constant  $\tau$  in the STDP response can also be obtained by fitting the LTP and LTD portions following:

$$\Delta G = B_{\rm G} \cdot \exp\left(-\frac{|\Delta t|}{\tau}\right) \tag{14}$$

The STDP time constants are extracted from devices with different areas presented in Figure 4b and are plotted in Figure 4d, showing the STDP time constant also decreases with increasing device area. This observation can also be explained by the device model. Specifically, based on Equations (13) and (14),  $\tau$  can be derived as:

$$\tau = \frac{c\rho d}{kA} \cdot \frac{\ln\left(\frac{T_1(t) - T_2}{T_1(0) - T_2}\right)}{\ln\left(\frac{\Delta G}{B_G}\right)}$$
(15)

showing inverse dependence of *A*.

Figure 4e,f shows the dependency of STDP characteristics on pulse amplitudes for the two cases. The pulse amplitude

SCIENCE NEWS \_\_\_\_\_ www.advancedsciencenews.com

DVANCED





**Figure 4.** a) Internal temperature profiles of two standard devices with areas of 1 and 100  $\mu$ m<sup>2</sup>, respectively. The temperature profiles were captured at the beginning of the filament formation. b) STDP behaviors experimentally measured from two fabricated devices with areas of 1 and 100  $\mu$ m<sup>2</sup>, respectively. c) The extracted STDP time window as a function of device area. The solid line is a rational function fitting following Equation 13. d) The extracted STDP time constant as a function of the device area, along with a linear fit. STDP characteristics obtained with different pulse amplitudes for the two devices with areas of e) 1 and f) 100  $\mu$ m<sup>2</sup>, respectively.

has a strong effect on the size of the conductance change, but a weak effect on the STDP time constant. This observation can again be explained by the heat dynamics (i.e., Equations (13) and (15)), where the internal short-term dynamics is only determined by material and device parameters where the applied pulse amplitude only has an indirect effect. On the other hand, tuning the pulse amplitude may be useful if a larger or smaller STDP effect is desired without changing its temporal characteristics.

We next study the 2nd-order memristor's dependency on the thermal conductivity in the thermal conducting path. As introduced in Figure 1b,c, an additional low thermal conductivity layer (HI layer) is added between the anticipated high temperature source point ( $T_1$  in Figure 1b) and the cooling point ( $T_2$ in Figure 1b) to slow down the cooling process. The effect of the HI laver is first simulated, as shown in Figure 2a (with HI) and 2b (without HI). Each frame in the figures corresponds to a time instant after the application of a programming pulse, and the plots show the internal temperature distribution evolution for the two cases. As suggested by Equations (13) and (15), HI helps the device to reach a higher internal temperature, and traps heat longer than the standard device without the HI layer. Figure 2c plots the evolutions of the peak internal temperature for the two cases, highlighting the effects of inserting the HI layer in increasing the peak temperature and slowing down the heat dissipation process.

These effects were then tested in experimentally fabricated devices with the HI layer using post-synaptic pulse of 0.7 V and pre-synaptic pulse of 0.9 V with 100 ns pulse width. Specifically, Figure 2d plots the STDP results obtained from a HI-device and a standard device, both having the same area of  $1 \,\mu m^2$ . While the standard device exhibits an STDP time window shorter than 1 µs, the HI-device exhibits a much longer time window of  $\approx 1$  ms. The time constant of the  $1 \,\mu m^2$  standard device and HI-device shown in Figure 2d are ( $\tau_d$ : 808.4 ns,  $\tau_p$ : 728.9 ns) and ( $\tau_d$ : 496.0 µs,  $\tau_p$ : 353.2 µs), respectively. The ability to slow down the heat dissipation and increase the STDP time window to ms level may open opportunities for the 2ndorder memristor devices and networks to directly interact with temporal signals at these time scales, that is, neural recordings from biological networks.<sup>[24]</sup> Similar to the standard device, the time constant of the HI-device can also be adjusted by tuning the device area (*A*<sub>1</sub>), as shown in Figure 2e, with time constants of ( $\tau_d$ : 496.0 µs,  $\tau_p$ : 353.2 µs) and ( $\tau_d$ : 465.3 µs,  $\tau_p$ : 324.7 µs), for the 1 and 4  $\mu$ m<sup>2</sup> devices, respectively. The fitting parameters for Equation (14) are summarized in Table 2.

Finally, we verify the feasibility to perform efficient temporal computation with a 2nd-order memristor network through the natively implemented of STDP learning rule with device-fitted simulation. Spiking neural networks have attracted increasing interest in recent years, as they offer an appealing opportunity for highly efficient computation with sparse, binary spikes. It

# ADVANCED SCIENCE NEWS \_\_\_\_\_

#### Table 2. Fitting parameters for the measurement data of devices.



| Device          | Layer stack                                                                                                       | Cross-sectional<br>area | Time constant of LTP ( $\tau_p$ ) | Time constant of LTD $(	au_{d})$ | Fitting parameter<br>of LTP, B <sub>Gp</sub> | Fitting parameter<br>of LTD, B <sub>Gd</sub> |
|-----------------|-------------------------------------------------------------------------------------------------------------------|-------------------------|-----------------------------------|----------------------------------|----------------------------------------------|----------------------------------------------|
| Standard device | Pd (35 nm)-<br>TaO <sub>x</sub> (30 nm)-Ta <sub>2</sub> O <sub>5</sub> (4 nm)-Pd (30 nm)                          | 1 μm²                   | 728.9 ns                          | 808.4 ns                         | 0.00282                                      | -0.00286                                     |
|                 |                                                                                                                   | 100 μm²                 | 58.6 ns                           | 68.8 ns                          | 0.04373                                      | -0.01120                                     |
| HI-device       | NiCr (40 nm)-Pd (35 nm)-TaO <sub>x</sub> (30 nm)-Ta <sub>2</sub> O <sub>5</sub><br>(4 nm)-Pd (60 nm)-NiCr (70 nm) | $1\mu m^2$              | 353.2 μs                          | 496.0 μs                         | 0.00243                                      | -0.00208                                     |
|                 |                                                                                                                   | $4\mu m^2$              | 324.7 μs                          | 465.3 μs                         | 0.00156                                      | -0.00141                                     |

has been shown that STDP can help learn the input correlations by exploiting the timing information embedded in the spiking sequences,<sup>[21]</sup> and STDP memristor-based networks

have been used to perform synaptic connection pattern reconstruction,<sup>[24]</sup> coincidence detection,<sup>[25]</sup> and correlation pattern detection.<sup>[26,27]</sup> However, in general these prior studies do not



**Figure 5.** a) Scheme showing the correlation detection setup. The network consists of a single post-synaptic LIF neuron and an array of 2nd-order memristor synapses. b) Ground truth covariance matrix between the 100 input spike trains with fixed firing rate at 500 Hz. The correlation within group 1 (neuron 0–9) is around 0.1, and the correlation within group 2 (neuron 10–19) is around 0.2. The other 80 spike trains are uncorrelated. c) Raster plot of the 100 pre-synaptic neurons (grey) and the post-synaptic neuron (magenta), showing the neurons' firing patterns. d) Histogram and e) line plot of the normalized device conductance evolution upon application of the input spike trains, showing the clear separation of device conductance values that form groups corresponding to different input correlation coefficients. The synaptic devices receiving inputs with the highest correlation (10–19) evolve to form a group having the highest conductance values, while the synaptic devices receiving uncorrelated inputs (20–99) experience no noticeable conductance changes. Histogram of the normalized final device conductance with different levels of D2D variations of f)  $A_p$  and g)  $\tau_p$ . The D2D variations are assumed to be Gaussian in the studies, and the percentage values represent the standard deviation of the D2D variations normalized against the mean value.





In this study, the inputs are 100 spike trains each consisting of around 1000 spiking events, where the different input streams have different degrees of correlation (see Experimental Section for more details). As shown in Figure 5b, each presynaptic input stream is fed into one memristor device, whose normalized conductance value represents the synaptic efficacy. The summation of the weighted post-synaptic potential then updates the membrane potential  $V_m$  of the post-synaptic neuron based on the LIF neuron model. The post-synaptic neuron fires an action potential if Vmm surpasses the defined threshold value  $V_{\rm th}$  (Figure 5a), and  $V_{\rm m}$  is reset to the resting state after firing. During this process, the 2nd-order memristive devices naturally adapt their conductance states based on the relative timing between the pre- and post-synaptic spikes. Compared with random inputs, correlated inputs have a higher probability to precede a post-synaptic spike, as the arrival times have higher chance to coincide with each other to cause a larger increase in V<sub>m</sub>.<sup>[21]</sup> The pre-post spike pairs then lead to potentiation of the corresponding synapses, which further increases the tendency to induce post-synaptic events. After this process, the correlated inputs can be identified from the uncorrelated ones, reflected by the clear separation of device conductance into several groups based on the degree of input correlation, as shown in Figure 5d,e. Inputs with higher correlation induce faster learning and higher memristor conductance values, while uncorrelated inputs do not induce significant conductance changes, as the LTP and LTD effects essentially cancel out each other, resulting in conductance distributions that match the correlation pattern in the inputs.

Beyond the assumption that all devices are identical, we performed additional studies to evaluate how device-to-device (D2D) variations would impact the system performance. D2D variations drawn from a Gaussian distribution have been added to the amplitude  $(A_p)$  and relaxation time  $(\tau_p)$  of the STDP behavior. Standard deviations from 5% to 15% have been simulated, which is comparable to the experimental measurement results in actual devices. Figure 5f shows that with increasing variations in  $A_p$ , the separation between groups of different correlation coefficients start to shrink. On the other hand, even with 15% percentage of standard variation, no overlapping between groups is observed. We also studied the influence of D2D variations of relaxation time. Figure 5g shows that the final histograms with different levels of  $\tau_{\rm p}$  variations are almost identical, meaning that the system is insensitive to the variation of relaxation time. Therefore, the purposed system exhibits robustness to D2D variations.

# 4. Conclusion

We demonstrate the ability to control the STDP characteristics, including the characteristic time constants of a 2nd order



memristor, by controlling the device's material parameters and structural dimensions. In contrast, different pulse amplitudes modify the size of the conductance changes without changing the characteristic time window. The ability to slow down the heat dissipation processes and extend the STDP time window allows the 2nd-order memristors to potentially be tailored to process temporal signals with specific temporal characteristics, such as neural recordings from biological networks. When integrated with a post-synaptic artificial neuron, the 2nd order memristor networks can capture the correlation in the pre-synaptic spikes in an unsupervised fashion, leading to inhomogeneous conductance distributions that match the correlation pattern of the input data.

# Acknowledgements

The authors would like to thank Dr. S. H. Lee and Dr. M. Zidan for helpful and stimulating discussions. This work was supported in part by the National Science Foundation through awards ECCS-1915550 and DMR-1810119.

# **Conflict of Interest**

The authors declare no conflict of interest.

# **Data Availability Statement**

The data that support the findings of this study are available from the corresponding author upon reasonable request.

# **Keywords**

2nd order memristors, correlation detection, neuromorphic computing, spike-timing dependent plasticity, spiking neural network (SNN)

Received: September 20, 2021 Revised: January 19, 2022 Published online: February 17, 2022

- International Technology Roadmap for Semiconductors, Editioin 2013, Executive Summary, 2013, http://www.itrs2.net/itrs-reports. html.
- [2] F. Cai, J. M. Correll, S. H. Lee, Y. Lim, V. Bothra, Z. Zhang, M. P. Flynn, W. D. Lu, Nat. Electron. 2019, 2, 290.
- [3] S. H. Jo, T. Chang, I. Ebong, B. B. Bhadviya, P. Mazumder, W. Lu, Nano Lett. 2010, 10, 1297.
- [4] A. Sebastian, M. L. Gallo, R. Khaddam-Aljameh, E. Eleftheriou, Nat. Nanotechnol. 2020, 15, 529.
- [5] C. Du, W. Ma, T. Chang, P. Sheridan, W. D. Lu, Adv. Funct. Mater. 2015, 25, 4290.
- [6] Y. J. Jeong, S. Kim, W. D. Lu, Appl. Phys. Lett. 2015, 107, 173105.
- [7] S. Kim, S. J. Kim, K. M. Kim, S. R. Lee, M. Chang, E. Cho, Y. B. Kim, C. J. Kim, U.-In Chung, I. K. Yoo, *Sci. Rep.* **2013**, *3*, 1680.
- [8] S. Kim, C. Du, P. Sheridan, W. Ma, S. Choi, W. D. Lu, Nano Lett. 2015, 15, 2203.
- [9] S. Yang, Y. Tang, R. S. Zucker, J. Neurophysiol. 1999, 81, 781.
- [10] M. A. Zidan, Y. J. Jeong, W. D. Lu, IEEE Trans. Nanotechnol. 2017, 16, 721.
- [11] S. Yu, S. Member, Y. Wu, R. Jeyasingh, D. Kuzum, H. P. Wong, IEEE Trans. Electron Devices 2011, 58, 2729.

#### **ADVANCED** SCIENCE NEWS

www.advancedsciencenews.com



- [12] T. Serrano-Gotarredona, T. Masquelier, T. Prodromakis, G. Indiveri, B. Linares-Barranco, *Front. Neurosci.* 2013, 7, 2.
- [13] Y. Guo, H. Wu, B. Gao, H. Qian, Front. Neurosci. 2019, 13, 812.
- [14] P. Diehl, M. Cook, Front. Comput. Neurosci. 2015, 9, 99.
- [15] S. H. Lee, J. Moon, Y. J. Jeong, J. Lee, X. Li, H. Wu, W. D. Lu, ACS Appl. Electron. Mater. 2020, 2, 701.
- [16] K. G. Schmidt, *Heat ATLAS*, Springer-Verlag, Berlin Heidelberg 2010.
- [17] J. R. Rumble, CRC Handbook of Chemistry and Physics, 101st ed., CRC Press, Boca Raton, Florida 2020.
- [18] M. Spittel, T. spittel, Part3: Non-ferrous Alloys Heavy metals, in Group VIII Advanced Materials and Technologies (Ed: H. Warlimont), Springer-Verlag, Berlin Heidelberg 2016.
- [19] X. Zhang, H. Xie, M. Fuji, H. Ago, K. Takahashi, T. Ikuta, H. Abe, T. Shimizu, *Appl. Phys. Lett.* **2005**, *86*, 171912.

- [20] J. Lee, W. Schell, X. Zhu, E. Kioupakis, W. D. Lu, ACS Appl. Mater. Interfaces 2019, 11, 11579.
- [21] R. Gütig, R. Aharonov, S. Rotter, H. Sompolinsky, J. Neurosci. 2003, 23, 3697.
- [22] S. Kim, S. Choi, W. Lu, ACS Nano 2014, 8, 2369.
- [23] L. Boteler, A. Lelis, M. Berman, M. Fish, 2019 IEEE 7th Workshop on Wide Bandgap Power Devices and Applications, IEEE, Piscataway 2019, p. 265.
- [24] Y. Wu, J. Moon, X. Zhu, W. D. Lu, Adv. Intell. Syst. 2021, 3, 2000276.
- [25] M. Prezioso, M. R. Mahmoodi, F. M. Bayat, H. Nili, H. Kim, A. Vincent, D. B. Strukov, *Nat. Commun.* 2018, 9, 5311.
- [26] T. Tuma, M. L. Gallo, A. Sebastian, S. Member, *IEEE Electron Device Lett.* 2016, *37*, 1238.
- [27] A. Sebastian, T. Tuma, N. Papandreou, M. L.e Gallo, L. Kull, T. Parnell, E. Eleftheriou, *Nat. Commun.* 2017, 8, 1115.