• Journal of Semiconductors
  • Vol. 46, Issue 3, 032201 (2025)
Xinyu Shen1,2, Zhao Zhang1,2,*, Jie Yang3, Jian Liu1,2..., Nanjian Wu1,2, Mohamad Sawan3 and Liyuan Liu1,2|Show fewer author(s)
Author Affiliations
  • 1State Key Laboratory of Superlattices and Microstructures, Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100049, China
  • 2Center of Materials Science and Optoelectronics Engineering, University of Chinese Academy of Sciences, Beijing 100049, China
  • 3CenBRAIN Neurotech, School of Engineering, Westlake University, Hangzhou 310024, China
  • show less
    DOI: 10.1088/1674-4926/24100022 Cite this Article
    Xinyu Shen, Zhao Zhang, Jie Yang, Jian Liu, Nanjian Wu, Mohamad Sawan, Liyuan Liu. A 0.0012-mm2 0.66-pJ/bit BPSK demodulator incorporating a loop-filter-less PLL achieving the maximum data rate of fcarrier/2[J]. Journal of Semiconductors, 2025, 46(3): 032201 Copy Citation Text show less

    Abstract

    This paper presents a compact ultra-low-power phase-locked loop (PLL) based binary phase-shift keying (BPSK) demodulator. The loop-filter-less (LPF-less) PLL is proposed to make phase of PLL output carrier signal track the phase of BPSK signal in real time. Thus, the maximum date rate can be significantly extended to the half of the carrier frequency (fcarrier) with a very compact size compared to prior PLL-based BPSK demodulators. Furthermore, eliminating all the static power in our LPF-less PLL, the energy efficiency is obviously improved. Fabricated in a 40-nm CMOS process, our prototype occupies 0.0012-mm2 core active area, and achieves the maximum data rate of 6.78 Mb/s (fcarrier/2) at fcarrier of 13.56 MHz. The power consumption and energy efficiency is 4.47 μW and 0.66 pJ/bit at 6.78-Mb/s data rate, respectively.

    Introduction

    Wireless implanted integrated circuits (IC) are becoming attractive in biomedical applications recently[1, 2]. The wireless power and data transfer telemetry (WPDT) sub-system is essential for the implanted ICs to transfer power and control data from the external transmitter, as shown in Fig. 1. Binary-phase-shift keying (BPSK) is particularly well-suited for the downlink data communication of wireless power and data transfer (WPDT) systems due to its high conversion efficiency, compatibility with narrowband inductive links, and robust immunity to amplitude noise. While amplitude-shift keying (ASK) and frequency-shift keying (FSK) are commonly employed modulation schemes, each with distinct data encoding methods through modulation of amplitude, frequency, they exhibit specific limitations. ASK, for instance, though straightforward and easily demodulated, is highly sensitive to variations in coil distance and external magnetic interference. ON−OFF keying (OOK) enhances ASK’s immunity by achieving a high modulation index (e.g., 100%)[3]. However, this improvement comes at the expense of power transmission efficiency due to carrier discontinuities during ON intervals. FSK modulations[4], while resilient to background interference, impose significant spectrum demands that constrain data rates, particularly in medical implantable devices where low-frequency carriers. Hence, designing a BPSK demodulator with small area, ultra-low power and sufficient data rate is significant[5].

    (Color online) Simplified block diagram of the WPDT system.

    Figure 1.(Color online) Simplified block diagram of the WPDT system.

    Recently, several BPSK demodulators have been reported, which can be categorized into the PLL-based[68] and PLL-less approaches[911]. The PLL-based approach has been demonstrated to be suitable for the inductive link with high-Q coils[6]. Yet, it is power hungry mainly due to the static power of voltage-controlled oscillator (VCO) and charge pump (CP) in the PLL; and the loop bandwidth is limited for the stability consideration, thus requiring a large integral capacitor CINT[12] with slow data rate. In contrast, using the edge-detection method without oscillator, CP and large capacitor, the PLL-less BPSK demodulator achieves much higher data rate with lower power and smaller area. However, it is not suitable for the use of high-Q coils because the phase of the received BPSK signal on the secondary coil needs a few cycles to invert its phase when data transition occurs, thus neutralizing the edge-detection scheme[6, 7].

    In this paper, we propose a PLL-based BPSK demodulator, which not only supports high-Q coils but also achieves comparable high data rate and compact size compared to prior PLL-less approaches. The loop-filter-less (LPF-less) PLL is proposed to make the PLL output phase track the input phase in real time by cutting the LPF, hence significantly increasing the data rate and reducing area concurrently. Meanwhile, the energy efficiency is also obviously improved thanks to the free of the static power in our LPF-less PLL.

    PLL-based BPSK demodulator

    Fig. 2 depicts the block diagram of the proposed PLL-based BPSK demodulator. It consists of our devised LPF-less PLL, a timing starter (TS), an edge alignment calibrator (EAC), a trigger detector (TD) and a clock and data recovery block (CDR). Thanks to the removal of the LPF, the PLL output PLLOUT can track the phase of its input BPSKIN in real time. So, the data rate is improved with significantly shrunk area. Meanwhile, power reduction is realized due to the free from the PLL static power, as detailed in Section 2.1. The EAC along with the TS aligns the rising edge of PLLOUT and BPSKIN, and generates two edge-aligned signals: PLLDL and BPSKSYN. Since there is a delay between the PLLDL and BPSKSYN, a phase difference step between the two signals is generated when input data transition occurs. Thus, TD can detect such phase difference to generate a trigger signal TR for the CDR to recover the modulated data DOUT with its synchronized clock CKOUT. DRMAX is used to select if the demodulator operates at its maximum data rate (fcarrier/2) or not, and the ratio between data rate and fcarrier is controlled by DRSEL. The details of the operation process, TS, EAC, TD, and CDR are presented in Section 2.2.

    (Color online) Block diagram of the proposed BPSK demodulator.

    Figure 2.(Color online) Block diagram of the proposed BPSK demodulator.

    Proposed loop-filter-less PLL

    To solve the issues of the widely used CP-based PLL described in Section 1, we propose the XOR-PLL by simply removing the LPF of prior XOR-gate-based PLL[13], as illustrated in Fig. 3. Without the LPF, the VCO in prior PLL becomes the pulse-driven oscillator (PDO), namely its node voltages at N1, N2, and N3 switch only when a low-level voltage pulse of VC occurs. Thus, the static power of the oscillator as well as the whole PLL is omitted. This enables obvious size shrinkage and energy efficiency improvement concurrently.

    (Color online) Schematics of (a) prior XOR-PLL, and (b) proposed LPF-Less PLL.

    Figure 3.(Color online) Schematics of (a) prior XOR-PLL, and (b) proposed LPF-Less PLL.

    The operation principle of the LPF-Less PLL is explained based on the simulated timing diagram depicted in Fig. 4 (fcarrier: 13.56 MHz). When BPSKIN switches from 0 to 1, a VC pulse generated, which makes IC to be VDD and causes N2 goes to 1; then, N3 goes to 0 to the trigger rising edges of PLLOUT and N1; after that, VC goes to 1, N1 and N3 hold their voltage values till next transition of BPSKIN; N2 goes down slowly due to the leakage caused by nonzero voltage of N1. The similar process is shown in Fig. 4 when BPSKIN switches from 1 to 0. Note that it is essential to hold the voltages of N1N3 when VC is 1, so, the thick-oxide MOSFETs are used in the PDO to minimize the leakage. The PLL input phase offset can be compensated by the EAC and TS, as described in Section 2.2.

    (Color online) Simulated timing of the LPF-Less PLL.

    Figure 4.(Color online) Simulated timing of the LPF-Less PLL.

    As explained before, the edge of PLLOUT is trigged by the VC pulse, which is generated by the transition of BPSKIN. This indicates that PLLOUT can track BPSKIN in real time when a input phase transition, as demonstrated by the simulation results in Fig. 5. Hence, the data rate can be significantly improved compared to prior PLL-based BPSK modulators.

    (Color online) Simulated timing of the LPF-Less PLL with input phase transition.

    Figure 5.(Color online) Simulated timing of the LPF-Less PLL with input phase transition.

    Operation process

    The schematics of TS, EAC, TD, and CDR are depicted in Figs. 6(a)−6(d), respectively, and the detailed operation process are explained in Fig. 7, including two parts: power-on calibration and normal operation.

    Schematics of (a) TS, (b) EAC, (c) TD, and (d) CDR.

    Figure 6.Schematics of (a) TS, (b) EAC, (c) TD, and (d) CDR.

    (Color online) Operation process: (a) DRMAX = 0, (b) DRMAX = 1.

    Figure 7.(Color online) Operation process: (a) DRMAX = 0, (b) DRMAX = 1.

    Initially, the power-on signal POR goes from 0 to 1 to reset TS, EAC, TD, and CDR. After that, the TS block resets the initial phase of BPSK signal BPSKSYN to lag the PLL signal PLLSYN, and the delay of the digitally controlled delay line (DCDL, in Fig. 6(b)) in the EAC is reset to its minimal value with HOLD = 0; then the phase alignment calibration begins, and the counter starts to increase the DCDL delay step by step until BPSKSYN begins to slightly lead PLLDL (roughly aligned with each other). At that time, HOLD goes to 1 to hold the state of counter and phase/frequency detector (PFD)[14], indicating the end of the power-on calibration.

    After power-on calibration, our demodulator starts to operate normally. Without input data transition, TR keeps 0, as indicated in Fig. 6(c) and Fig. 7. When input data transition occurs, TR goes to 1. The rising edge of the TR is used to get DOUT and CKOUT through the CDR block[7]. The main difference between DRMAX = 0 and DRMAX = 1 is TR signal, as compared in Figs. 7(a) and 7(b). When DRMAX = 0 (see Fig. 7(a)), the delay between the data transition and the TR falling edge τTR is 2Tcarrier (PLL output period). This is OK for the date rate ≤(fcarrier/4), but such TR cannot distinguish consecutive data transition at the data rate of (fcarrier/2). In the contrast, if DRMAX = 1, TR pulse width can be reduced to t2, as indicated in Fig. 6(c). Thus, τTR can be <2Tcarrier by properly setting t2 so that the modulated data (DATAMOD) can be correctly demodulated (see DDEMOD and DOUT) at the maximum data rate of fcarrier/2, as illustrated in Fig. 7(b).

    This demodulator can still operate at the data rate below fcarrier/2 if DRMAX = 1, as indicated in Fig. 7(b). However, such mode cannot support the use of high-Q coils because the received input of the demodulator requires several cycles to invert its phase when data transition occurs. Hence, the phase difference between BPSKSYN and PLLDL lasts more than one carrier period. This means that one input data transition may generate more than one TR pulses when DRMAX = 1, thus leading to wrong demodulated output, as demonstrated by the simulation results shown in Fig. 8(a). Alternately, if DRMAX = 0, each data transition still generate one TR pulse, thus maintaining the correct function, as illustrated in Fig. 8(b).

    (Color online) Simulation at the data rate of (fcarrier/8) with high-Q coils: (a) DRMAX = 1, (b) DRMAX = 0.

    Figure 8.(Color online) Simulation at the data rate of (fcarrier/8) with high-Q coils: (a) DRMAX = 1, (b) DRMAX = 0.

    In summary, our proposed BPSK demodulator can operate at a high data rate of fcarrier/2 with low-Q coils by selecting DRMAX to be 1, and can also support high-Q coils at a lower data rate with DRMAX = 0.

    Measurement

    Fig. 9 exhibits our 40-nm CMOS prototype. The core area of our BPSK demodulator is only 0.0012 mm2, of which only 0.000123 mm2 (8.5 μm × 14.5μm) is occupied by the LPF-Less PLL. It is supplied by a 0.6-V supply voltage and operates at 13.56-MHz fcarrier. An on-chip BPSK modulator with a PRBS-7 generator was also implemented for function testing. Fig. 10 demonstrates the operation at its maximum data rate of 6.78 Mb/s (fcarrier/2). The dc power excluding output buffer for testing is 4.47 μW, corresponding to an energy efficiency of 0.66 pJ/bit.

    (Color online) Chip photograph with the core layout of the BPSK demodulator.

    Figure 9.(Color online) Chip photograph with the core layout of the BPSK demodulator.

    ParametersThis workAPCCAS'22[7]OJCAS'21[8]JSSC'18[6]JSSC'15[9]SSCL'20[10]ISCAS'13[11]BioCAS'14[16]
    TypeLPF-Less-PLLBasedPLL-BasedPLL-BasedPLL-BasedPLL-BasedPLL-LessPLL-LessPLL-Less
    Process (nm)4040180180130180180180
    High-Q telemetryYESYESYESYESYESNONONO
    Data rate (Mb/s)6.780.84752.260.2111.312513.56110
    fcarrier (MHz)13.5613.5613.5613.562113.56210
    (Data rate)/fcarrier1/21/161/61/641/1611/21
    Power (μW)4.4731.654817.4217200012.28277.9
    Energy efficiency(pJ/bit)0.6637.353621028.441523.80.9827.79
    Supply voltage (V)0.60.71.82.01.20.91.81.8
    Core area (mm2)0.00120.0040.150.137NA0.00380.0350.0035

    Table 1. Performance summary and comparison.

    (Color online) Measurement results at the maximum data rate 6.78 Mb/s.

    Figure 10.(Color online) Measurement results at the maximum data rate 6.78 Mb/s.

    We also measured the performance with high-Q coins (L: 2.6 μH, Q: 68.4)[15] at the demodulator’s input, as the test setup shown in Fig. 11(a). The modulated BPSK signal is generated by an arbitrary function generator and a transmitter (TX) board, and transmitted to the receiver (RX) frond-end board through two coupled high-Q coils for our BPSK demodulator. Fig. 11(b) shows the measured results at the data rate of 1.695 Mb/s (fcarrier/2), demonstrating the correct function with high-Q coils.

    (Color online) Measurement with high-Q coils: (a) setup, (b) results at 1.695 Mb/s.

    Figure 11.(Color online) Measurement with high-Q coils: (a) setup, (b) results at 1.695 Mb/s.

    Table 1 compare the performance of this work with other recently published BPSK demodulators. Compared to other PLL-based approaches, the core area of our work is at least 3.3 times smaller, and the data rate is at least 8 times higher with the best energy efficiency. When compared to other PLL-less BPSK demodulators, the area and energy efficiency of this work are also among the best reported to date with a comparable data rate, and this work supports the operation with high-Q coils.

    Conclusions

    This paper presents a PLL-based BPSK demodulator and demonstrated in a 40-nm CMOS process. Thanks to our proposed LPF-Less PLL, the core area is significantly shrunk with the concurrent improvement of the data rate and energy efficiency.

    References

    [1] S H Wang, Y K Huang, C Y Chen et al. Design of a bone-guided cochlear implant microsystem with monopolar biphasic multiple stimulations and evoked compound action potential acquisition and its in vivo verification. IEEE J Solid State Circuits, 56, 3062(2021).

    [2] X H Qian, Y C Lee, J H Chang et al. Design and in vivo verification of a CMOS bone-guided cochlear implant microsystem. IEEE Trans Biomed Eng, 66, 3156(2019).

    [3] Y Y Jia, U Guler, Y P Lai et al. 26.8 A trimodal wireless implantable neural interface system-on-chip. 2020 IEEE International Solid-State Circuits Conference-(ISSCC), 414(2020).

    [4] Y Park, S T Koh, J Lee et al. 33.7 A frequency-splitting-based wireless power and data transfer IC for neural prostheses with simultaneous 115mW power and 2.5Mb/s forward data delivery. 2021 IEEE International Solid-State Circuits Conference (ISSCC), 472(2021).

    [5] Y M Hu, M Sawan. A fully integrated low-power BPSK demodulator for implantable medical devices. IEEE Trans Circuits Syst I Regul Pap, 52, 2552(2005).

    [6] C H Cheng, P Y Tsai, T Y Yang et al. A fully integrated 16-channel closed-loop neural-prosthetic CMOS SoC with wireless power and bidirectional data telemetry for real-time efficient human epileptic seizure control. IEEE J Solid State Circuits, 53, 3314(2018).

    [7] Y X Li, X Y Shen, Z Y Zhang et al. A 0.004-mm2 0.7-V 31.654-μW BPSK demodulator incorporating dual-path loop self-biased PLL. 2022 IEEE Asia Pacific Conference on Circuits and Systems (APCCAS), 569(2022).

    [8] L Pan, M Y Chen, Y F Chen et al. An energy-autonomous power-and-data telemetry circuit with digital-assisted-PLL-based BPSK demodulator for implantable flexible electronics applications. IEEE Open J Circuits Syst, 2, 721(2021).

    [9] H Cho, H Lee, J Bae et al. A 5.2 mW IEEE 802.15. 6 HBC standard compatible transceiver with power efficient delay-locked-loop based BPSK demodulator. IEEE J Solid State Circuits, 50, 2549(2015).

    [10] C Y Lo, H C Hong. A 0.9 pJ/b, reference clock free, delay-based, all-digital coherent BPSK demodulator. IEEE Solid State Circuits Lett, 3, 498.

    [11] B P Wilkerson, J K Kang. A low power BPSK demodulator for wireless implantable biomedical devices. 2013 IEEE International Symposium on Circuits and Systems (ISCAS), 626(2013).

    [12] Z Zhang, G Zhu, C P Yue. A 0.25–0.4-V, sub-0.11-mW/GHz, 0.15–1.6-GHz PLL using an offset dual-path architecture with dynamic charge pumps. IEEE J Solid State Circuits, 56, 1871(2021).

    [13] L Kong, B Razavi. A 2.4 GHz 4 mW integer-N inductorless RF synthesizer. IEEE J Solid State Circuits, 51, 626(2016).

    [14] Z Zhang, J C Yang, L Y Liu et al. A 0.9–2.25-GHz sub-0.2-mW/GHz compact low-voltage low-power hybrid digital PLL with loop bandwidth-tracking technique. IEEE Trans Very Large Scale Integr VLSI Syst, 26, 933(2018).

    [16] M Karimi, M H Maghami, M Faizollah et al. A noncoherent low-power high-data-rate BPSK demodulator and clock recovery circuit for implantable biomedical devices. 2014 IEEE Biomedical Circuits and Systems Conference (BioCAS) Proceedings, 372(2014).

    Xinyu Shen, Zhao Zhang, Jie Yang, Jian Liu, Nanjian Wu, Mohamad Sawan, Liyuan Liu. A 0.0012-mm2 0.66-pJ/bit BPSK demodulator incorporating a loop-filter-less PLL achieving the maximum data rate of fcarrier/2[J]. Journal of Semiconductors, 2025, 46(3): 032201
    Download Citation