{"id":12,"date":"2016-07-09T06:54:28","date_gmt":"2016-07-09T06:54:28","guid":{"rendered":"http:\/\/147.47.120.188\/?page_id=12"},"modified":"2026-04-16T13:39:52","modified_gmt":"2026-04-16T13:39:52","slug":"research","status":"publish","type":"page","link":"https:\/\/mms.snu.ac.kr\/?page_id=12","title":{"rendered":"Research"},"content":{"rendered":"<h1>Current research projects<\/h1>\n<h3>Energy-efficient\u00a0machine learning processors<\/h3>\n<p style=\"text-align: justify;\">We are designing highly energy-efficient machine learning processors by developing both hardware-friendly algorithms and low-power hardware design techniques. Our ultimate goal is to build practical machine-learning processors that exhibit both state-of-the-art algorithm performance and energy efficiency by finding the best algorithm-hardware combination. For instance, we developed a neuromorphic processor implementing an SNN (Spiking Neural Network), which achieved substantially better energy efficiency while closely matching conventional deep learning in accuracy.<\/p>\n<h3>Hardware-oriented deep learning algorithms<\/h3>\n<p style=\"text-align: justify;\">Conventional deep learning algorithms indeed demonstrate excellent performance across many tasks, but often they are not suitable for hardware implementation due to excessive amount of computation and memory requirements. We aim to optimize state-of-the-art deep learning algorithms to make them suitable for hardware implementation by significantly lowering computational overheads. We recently demonstrated a reliable low-precision training algorithm with minimal accuracy degradation, allowing for the use of lightweight arithmetic units and lowering memory access overheads.<\/p>\n<h3>Circuits for machine learning<\/h3>\n<p style=\"text-align: justify;\">Digital CMOS circuits offer very reliable operation and are easy to design with, but they have certain limits on the achievable performance and efficiency. We are trying to overcome this limitation by adopting various circuit techniques in machine learning systems. One example is in-memory computing circuits, which improve energy efficiency through analog or mixed-signal operation on various deep learning models.<\/p>\n<h3>Power management circuits<\/h3>\n<p style=\"text-align: justify;\">Power supply regulation is one of the key challenges in modern large-scale digital SoCs. We look into both on-chip and off-chip approaches to reliably regulate power supply while mitigating common design issues. In addition, we aim to adopt system-level analysis to minimize design overhead.<\/p>\n<p><!--\n\n\n<h3>Analog front-end\u00a0circuit for SiPM sensors<\/h3>\n\n\n\n\n<p style=\"text-align: justify;\">Typical\u00a0radiation detecting sensors require a dedicated highly sensitive\u00a0analog front-end. By implementing the front-end as an integrated chip (IC), we look forward to achieving high resolution performance, as well as low power consumption and compact die area. Furthermore, we aim to develop novel circuit techniques to compensate degrading effects when sensing signals from\u00a0a large SiPM array. Application-specific\u00a0design process\u00a0will further\u00a0optimize the whole system. This design is expected to be practically applied to current medical imaging applications, such as gamma camera and PET.<\/p>\n\n\n\n&nbsp;\n--><\/p>\n<h1>Recent research examples<\/h1>\n<h3>8-bit Floating-Point Deep Neural Network Training Processor<\/h3>\n<p style=\"text-align: center;\"><a href=\"http:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/isscc2021-2.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-328 size-medium\" src=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/isscc2021-2-300x175.jpg\" alt=\"isscc2021-2\" width=\"300\" height=\"175\" srcset=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/isscc2021-2-300x175.jpg 300w, https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/isscc2021-2.jpg 566w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/a><a href=\"http:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/isscc2021-1.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-327\" src=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/isscc2021-1-1024x382.jpg\" alt=\"isscc2021-1\" width=\"469\" height=\"175\" srcset=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/isscc2021-1-1024x382.jpg 1024w, https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/isscc2021-1-300x112.jpg 300w, https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/isscc2021-1-768x286.jpg 768w, https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/isscc2021-1-624x233.jpg 624w, https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/isscc2021-1.jpg 1150w\" sizes=\"auto, (max-width: 469px) 100vw, 469px\" \/><\/a><\/p>\n<p style=\"text-align: justify;\">Due to the recent shift in the machine learning community towards using non-sparse activation functions such as Leaky ReLU or Swish for better training convergence, state-of-the-art models no longer exhibit the sparsity found in conventional ReLU-based models. Moreover, more difficult tasks such as image super-resolution require higher precision than plain 8-bit integers not just for training, but for inference without large accuracy degradation.\u00a0To overcome such challenges,\u00a0we developed a deep learning processor with support for efficient inference and training for various modern neural networks. Our key contributions are: (1) an 8-bit floating point data format with shared exponent bias (FP8-SEB) for robust training in low precision, (2) a processing architecture employing 24-way Fused Multiply-Add (FMA) trees and high-precision accumulators that improves training accuracy and energy efficiency, and (3) 2-D routing scheme on input\/output points of processing element array for flexible training of various models with minimal hardware overhead. Based on these contributions, our neural network training processor achieves 4.81TFLOPS\/W energy efficiency, 567GFLOPS performance, and robust training on various models including Generative Adversarial Network (GAN), Long-Short-Term-Memory (LSTM), transformer models, as well as Convolutional Neural Network (CNN).<\/p>\n<p>Publications: <a href=\"https:\/\/ieeexplore.ieee.org\/document\/9366031\">ISSCC 2021<\/a>, <a href=\"https:\/\/ieeexplore.ieee.org\/abstract\/document\/9515082\">JSSC 2022<\/a><br \/>\nAwards:<br \/>\n&#8211; Gold Award (1st place in circuit design), Samsung Humantech, 2021.<br \/>\n&#8211; Best Design Award, ACM\/IEEE International Symposium on Low Power Electronics and Design, 2021.<\/p>\n<h3><\/h3>\n<h3>Energy-efficient neuromorphic processor with on-chip training<\/h3>\n<p style=\"text-align: center;\"><a href=\"http:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/neuromorphic1.gif\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-272\" src=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/neuromorphic1.gif\" alt=\"neuromorphic1\" width=\"240\" height=\"179\" \/><\/a> <a href=\"http:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/neuromorphic2.gif\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-273\" src=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/neuromorphic2.gif\" alt=\"neuromorphic2\" width=\"127\" height=\"170\" \/><\/a> <a href=\"http:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/neuromorphic3.gif\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-274\" src=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/neuromorphic3-300x209.gif\" alt=\"neuromorphic3\" width=\"240\" height=\"167\" \/><\/a><\/p>\n<p style=\"text-align: justify;\">Neuromorphic computing algorithms such as SNN (Spiking Neural Network) offer great energy efficiency, but they are not considered\u00a0practical solutions yet due to low algorithm accuracy. In this work, we proposed a new neuromorphic computing algorithm based on\u00a0bio-plausible approach, which provides similar performance to SGD-based learning scheme.\u00a0By combining\u00a0a well-optimized architecture to further improve computation efficiency,\u00a0we designed a ultra-low power\u00a0on-chip trainable neuromorphic processor for classification tasks. Training energy of 254.3 nJ\/image is achieved through a hardware-oriented direct feedback algorithm, out-of-order weight updates, and an update-skipping mechanism. During prediction, the processor achieves 97.83% MNIST recognition accuracy with 236.5 nJ\/prediction. This result demonstrates a 7.5% training energy overhead, while providing superior energy efficiency than recent inference-only designs.<\/p>\n<p>Publications: <a href=\"https:\/\/ieeexplore.ieee.org\/document\/8662398\">ISSCC 2019<\/a>, <a href=\"https:\/\/ieeexplore.ieee.org\/abstract\/document\/8867974\">JSSC 2020<\/a><br \/>\nMedia coverage: <a href=\"https:\/\/news.naver.com\/main\/read.nhn?mode=LSD&amp;mid=sec&amp;sid1=105&amp;oid=020&amp;aid=0003217929\">Dong-A Ilbo (Korean Newspaper)<\/a><\/p>\n<h3><\/h3>\n<h3>Low-dropout regulator\u00a0with high efficiency and wide operating range<\/h3>\n<p style=\"text-align: center;\"><a href=\"http:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/LDO1.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-319\" src=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/LDO1-300x225.jpg\" alt=\"LDO1\" width=\"247\" height=\"185\" srcset=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/LDO1-300x225.jpg 300w, https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/LDO1-768x575.jpg 768w, https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/LDO1-624x467.jpg 624w, https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/LDO1.jpg 887w\" sizes=\"auto, (max-width: 247px) 100vw, 247px\" \/><\/a><a href=\"http:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/LDO3.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-320\" src=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/LDO3-300x199.jpg\" alt=\"LDO3\" width=\"276\" height=\"183\" srcset=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/LDO3-300x199.jpg 300w, https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/LDO3.jpg 548w\" sizes=\"auto, (max-width: 276px) 100vw, 276px\" \/><\/a><a href=\"http:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/LDO4.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-321\" src=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/LDO4-300x192.jpg\" alt=\"LDO4\" width=\"267\" height=\"171\" srcset=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/LDO4-300x192.jpg 300w, https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/LDO4.jpg 437w\" sizes=\"auto, (max-width: 267px) 100vw, 267px\" \/><\/a><\/p>\n<p style=\"text-align: justify;\">Large-scale digital SoCs must be accompanied by a reliable, efficient power management system. Low-dropout regulator (LDO) is one of the key blocks in those systems, where\u00a0it must generate a constant output voltage\u00a0under a wide range of input voltage and load current. We\u00a0proposed a new LDO topology\u00a0using a voltage-difference-to-time converter and direct output feedback path, and the design combines the advantages of conventional digital and analog LDOs.\u00a0 The\u00a0fabricated LDO achieves\u00a00.202fs FOM, which is significantly lower than prior works.<\/p>\n<p style=\"text-align: justify;\">Publication:\u00a0<a href=\"https:\/\/ieeexplore.ieee.org\/abstract\/document\/9134432\">TCAS-II 2021<\/a><\/p>\n<h3><\/h3>\n<h3>Radiation hardening by design<\/h3>\n<p style=\"text-align: center;\"><a href=\"http:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/rhbd4.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-242\" src=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/rhbd4-300x273.jpg\" alt=\"rhbd4\" width=\"209\" height=\"191\" srcset=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/rhbd4-300x273.jpg 300w, https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/rhbd4.jpg 416w\" sizes=\"auto, (max-width: 209px) 100vw, 209px\" \/><\/a>\u00a0<a href=\"http:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/rhbd3.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-241\" src=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/rhbd3-300x225.jpg\" alt=\"rhbd3\" width=\"255\" height=\"191\" srcset=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/rhbd3-300x225.jpg 300w, https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/rhbd3-624x468.jpg 624w, https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/rhbd3.jpg 742w\" sizes=\"auto, (max-width: 255px) 100vw, 255px\" \/><\/a>\u00a0<a href=\"http:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/rhbd2.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-240\" src=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/rhbd2-165x300.jpg\" alt=\"rhbd2\" width=\"104\" height=\"189\" srcset=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/rhbd2-165x300.jpg 165w, https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/rhbd2-768x1395.jpg 768w, https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/rhbd2-564x1024.jpg 564w, https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/rhbd2-624x1133.jpg 624w, https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/rhbd2.jpg 1960w\" sizes=\"auto, (max-width: 104px) 100vw, 104px\" \/><\/a><\/p>\n<p style=\"text-align: justify;\">Under radiation emitting circumstances, high energy particles induce accumulation of holes in the oxide. The accumulated holes modify transistor characteristics such as threshold voltage,\u00a0degrading hardware reliability.\u00a0In order to\u00a0avoid transistor malfunction\u00a0in a circuit, radiation hardening by design (RHBD) techniques\u00a0have been\u00a0actively studied. In this project, we applied several topologies to\u00a0an inverter which is\u00a0a\u00a0basic building block of all digital systems, and\u00a0experimentally analyze the\u00a0reliability of each\u00a0circuit topology under high radiation dose. Measured results confirm that the stacked NMOS topology performs best, reducing switching point variation by &gt;10x and average power by 20%\u00a0at the expense of\u00a030% area overhead.<\/p>\n<p>Publication:\u00a0<a href=\"https:\/\/www.sciencedirect.com\/science\/article\/pii\/S0168900218314670\">NIMA 2020<\/a><\/p>\n<p><em>Please note that the completed projects shown below were conducted at the University of Michigan (Advisor: Prof. Dennis Sylvester) and MIT (Advisor: Prof. Anantha Chandrakasan).<\/em><\/p>\n<h3>Navigation system for the\u00a0visually impaired<\/h3>\n<p style=\"text-align: center;\"><a href=\"http:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/die-300x240-1.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-30\" src=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/die-300x240-1-300x240.png\" alt=\"die-300x240\" width=\"230\" height=\"184\" \/><\/a><a href=\"http:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/system1-300x228-1.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-88\" src=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/system1-300x228-1-300x228.png\" alt=\"system1-300x228\" width=\"235\" height=\"179\" \/><\/a><a href=\"http:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/system2-266x300-1.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-91\" src=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/system2-266x300-1-266x300.png\" alt=\"system2-266x300\" width=\"187\" height=\"211\" \/><\/a><\/p>\n<p style=\"text-align: center;\"><a href=\"http:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/DSC_0121-624x413.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-medium wp-image-34\" src=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/DSC_0121-624x413-300x199.jpg\" alt=\"DSC_0121-624x413\" width=\"300\" height=\"199\" srcset=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/DSC_0121-624x413-300x199.jpg 300w, https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/DSC_0121-624x413.jpg 624w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/a><a href=\"http:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/IMG_3260-1024x768-1.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-75\" src=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/IMG_3260-1024x768-1-300x225.jpg\" alt=\"IMG_3260-1024x768\" width=\"263\" height=\"197\" srcset=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/IMG_3260-1024x768-1-300x225.jpg 300w, https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/IMG_3260-1024x768-1-768x576.jpg 768w, https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/IMG_3260-1024x768-1.jpg 1024w, https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/IMG_3260-1024x768-1-624x468.jpg 624w\" sizes=\"auto, (max-width: 263px) 100vw, 263px\" \/><\/a><\/p>\n<p style=\"text-align: justify;\">In this project, we developed a fully-integrated navigation system for the visually impaired. Since processing 3-D imaging data using exsiting algorithms is computationally expensive, commercial off-the-shelf processors are not able to process 3-D data in real time. To tackle this issue, we designed a highly energy-efficient vision processor tailored for general 3-D image data processing algorithms. By designing a new hardware architecture as well as optimizing conventional algorithms further, the processor fabricated in 40nm CMOS technology consumes only 8mW while processing 30fps 3-D input video stream in real time. We also implemented a fully-integrated navigation system and confirmed its functionality in various experiments.<\/p>\n<p>Publication: <a href=\"http:\/\/ieeexplore.ieee.org\/xpl\/articleDetails.jsp?arnumber=7418084\"><span style=\"color: #0066cc;\">ISSCC 2016<\/span><\/a><br \/>\nSelected media coverage: <a href=\"https:\/\/www.eetimes.com\/19-views-of-isscc\/\"><span style=\"color: #0066cc;\">EE Times,<\/span><\/a> <a href=\"http:\/\/www.techtimes.com\/articles\/131279\/20160207\/wearable-device-with-3d-camera-may-guide-the-visually-impaired.htm\"><span style=\"color: #0066cc;\">Tech Times<\/span><\/a>, <a href=\"http:\/\/www.qmed.com\/mpmn\/medtechpulse\/could-virtual-guide-dog-help-visually-impaired\"><span style=\"color: #0066cc;\">Qmed<\/span><\/a>, <a href=\"http:\/\/www.dailydot.com\/technology\/mit-virtual-guide-dog-braille\/\"><span style=\"color: #0066cc;\">The Daily Dot<\/span><\/a><\/p>\n<h3>Face recognition system for mobile platforms<\/h3>\n<p style=\"text-align: center;\">\u00a0<a href=\"http:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/face.gif\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-46\" src=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/face.gif\" alt=\"face\" width=\"513\" height=\"216\" \/><\/a><a href=\"http:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/face1.gif\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-47\" src=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/face1.gif\" alt=\"face1\" width=\"199\" height=\"215\" \/><\/a><\/p>\n<p style=\"text-align: justify;\">Face detection and recognition\u00a0are now available on many mobile platforms. For instance, facebook app can reliably detect faces from uploaded photos. However, we wanted to take one step further and design a complete standalone\u00a0face recognition system that has both face detection and recognition features and processes input\u00a0video in real time. Based on classical but very reliable algorithms including cascade classifiers and Support Vector Machine (SVM), we optimized hardware architecture to minimize processing overheads. In addition, we proposed a new SRAM design tailored for this specific application to tackle the issue of large leakage current in advaned CMOS processes. The\u00a0fabricated design\u00a0consumes 23mW while processing HD video with 5.5fps throughput.<\/p>\n<p>Publication: <a href=\"http:\/\/ieeexplore.ieee.org\/xpls\/abs_all.jsp?arnumber=7231322\"><span style=\"color: #0066cc;\">Symposium on VLSI Circuits 2015<\/span><\/a><\/p>\n<h3>Syringe-implantable ECG monitoring device<\/h3>\n<p style=\"text-align: center;\"><a href=\"http:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/ecg1.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-36\" src=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/ecg1.png\" alt=\"ecg1\" width=\"258\" height=\"190\" \/><\/a><a href=\"http:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/ecg2.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-39\" src=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/ecg2.png\" alt=\"ecg2\" width=\"267\" height=\"189\" srcset=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/ecg2.png 323w, https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/ecg2-300x212.png 300w\" sizes=\"auto, (max-width: 267px) 100vw, 267px\" \/><\/a><a href=\"http:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/ecg3.gif\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-42\" src=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/ecg3.gif\" alt=\"ecg3\" width=\"232\" height=\"186\" \/><\/a><\/p>\n<p style=\"text-align: justify;\">With continuous semiconductor technology scaling, recently small form factor implantable biomedical systems gained lots of attentions due to its\u00a0enhanced computation capacity. In this project, we implemented\u00a0a syringe-implantable ECG monitoring system primarily for arrhythmia detection. The system has a powerful ARM Cortex-M0 paired with custom digital signal processing blocks, which delivers both\u00a0enough computing power and flexibility. The proposed system only consumes 65nW while monitoring ECG waveforms continuously and we confirmed its functionality in the experiment using a live sheep heart.<\/p>\n<p>Publications: <a href=\"http:\/\/ieeexplore.ieee.org\/xpl\/articleDetails.jsp?arnumber=6757494\"><span style=\"color: #0066cc;\">ISSCC 2014<\/span><\/a>, <a href=\"http:\/\/ieeexplore.ieee.org\/xpl\/articleDetails.jsp?arnumber=6957602\"><span style=\"color: #0066cc;\">JSSC 2015<\/span><\/a><\/p>\n<h3><\/h3>\n<h3>Feature extraction accelerator<\/h3>\n<p style=\"text-align: center;\"><a href=\"http:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/fe1.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-54\" src=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/fe1.png\" alt=\"fe1\" width=\"468\" height=\"211\" srcset=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/fe1.png 584w, https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/fe1-300x135.png 300w\" sizes=\"auto, (max-width: 468px) 100vw, 468px\" \/><\/a><a href=\"http:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/fe3-182x300-1.gif\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-62\" src=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/fe3-182x300-1.gif\" alt=\"fe3-182x300\" width=\"129\" height=\"213\" \/><\/a><\/p>\n<p style=\"text-align: justify;\">Visual feature extraction is a\u00a0method to extract useful information from input image or video, and this information can be later used in other algorithms including object recognition and pose estimation. Since it is a key part in computer vision\u00a0and generally requires a large amount of computation, we designed a specialized\u00a0accelerator to significantly reduce processing overhead. Starting from original Speeded-Up Robust Features (SURF)\u00a0algorithm we modified it from hardware design perspective, which resulted in more hardware-friendly algorithm. We also proposed a low-power robust FIFO design to mitigate tremendous amount of leakage power in low operating voltages. The proposed design provides 3.5x efficiency improvements than state-of-the-art designs.<\/p>\n<p>Publications: <a href=\"http:\/\/ieeexplore.ieee.org\/xpl\/articleDetails.jsp?arnumber=6487684\"><span style=\"color: #000080;\">ISSCC 2013<\/span><\/a>, <a href=\"http:\/\/ieeexplore.ieee.org\/xpl\/articleDetails.jsp?arnumber=6762964\"><span style=\"color: #000080;\">JSSC 2014<\/span><\/a><br \/>\nMedia coverage: <a href=\"http:\/\/spectrum.ieee.org\/robotics\/robotics-hardware\/better-eyes-for-flying-robots\"><span style=\"color: #0066cc;\">IEEE Spectrum<\/span><\/a><\/p>\n<h3>Energy-efficient FFT processor<\/h3>\n<p style=\"text-align: center;\">\u00a0 <a href=\"http:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/fft1.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-63\" src=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/fft1.png\" alt=\"fft1\" width=\"338\" height=\"286\" srcset=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/fft1.png 1096w, https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/fft1-300x253.png 300w, https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/fft1-768x648.png 768w, https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/fft1-1024x864.png 1024w, https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/fft1-624x527.png 624w\" sizes=\"auto, (max-width: 338px) 100vw, 338px\" \/><\/a><a href=\"http:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/fft2.gif\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-68\" src=\"https:\/\/mms.snu.ac.kr\/wp-content\/uploads\/2016\/07\/fft2.gif\" alt=\"fft2\" width=\"238\" height=\"276\" \/><\/a><\/p>\n<p style=\"text-align: justify;\">Fast Fourier Transform (FFT) is one of the most famous digital signal processing algorithms used in many platforms. In this project, we targeted achieving maximum energy efficiency for the given algorithm in order to increase the lifetime of\u00a0battery-powered systems. We first developed a low leakage, energy-efficient parallel FFT architecture. This architecture was applied to the final FFT processor along with a novel circuit technique called &#8220;super-pipelining&#8221;, resulting in 2x better energy efficiency than previous works.<\/p>\n<p>Publications: <a href=\"http:\/\/ieeexplore.ieee.org\/xpl\/articleDetails.jsp?arnumber=5746346\"><span style=\"color: #000080;\">ISSCC 2011<\/span><\/a>, <a href=\"http:\/\/ieeexplore.ieee.org\/xpl\/articleDetails.jsp?arnumber=6069820\"><span style=\"color: #000080;\">JSSC 2012<\/span><\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Current research projects Energy-efficient\u00a0machine learning processors We are designing highly energy-efficient machine learning processors by developing both hardware-friendly algorithms and low-power hardware design techniques. Our ultimate goal is to build practical machine-learning processors that exhibit both state-of-the-art algorithm performance and energy efficiency by finding the best algorithm-hardware combination. For instance, we developed a neuromorphic processor [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-12","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/mms.snu.ac.kr\/index.php?rest_route=\/wp\/v2\/pages\/12","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/mms.snu.ac.kr\/index.php?rest_route=\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/mms.snu.ac.kr\/index.php?rest_route=\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/mms.snu.ac.kr\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/mms.snu.ac.kr\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=12"}],"version-history":[{"count":46,"href":"https:\/\/mms.snu.ac.kr\/index.php?rest_route=\/wp\/v2\/pages\/12\/revisions"}],"predecessor-version":[{"id":471,"href":"https:\/\/mms.snu.ac.kr\/index.php?rest_route=\/wp\/v2\/pages\/12\/revisions\/471"}],"wp:attachment":[{"href":"https:\/\/mms.snu.ac.kr\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=12"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}