Hengtao Guo

Google Scholar / LinkedIn / Research Gate / Github

I am a Machine Learning Software Engineer at Google, where I focus on building recommendation systems in YouTube main app. Before joining Google, I have worked as a Research Intern at UII America.

I received my Ph.D. in Biomedical Engineering from Rensselaer Polytechnic Institute in 2022, and B.S. in Computer Science from Wuhan University in 2018.

Research Fields

My research focuses span across machine learning, deep learning, and generative artificial intelligence:

Computer Vision: Medical image analysis, image registration/segmentation, ultrasound reconstruction
Recommendation Systems: User modeling, video/audio representation learning
Natural Language Processing: Large language models, retrival-augmented generation

Publications

	SMPL-A: Modeling Person-Specific Deformable Anatomy Hengtao Guo, Benjamin Planche, Meng Zheng, Srikrishna Karanam, Terrence Chen, Ziyan Wu IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022 We propose a learning-based method that uses medical scans to predict internal organ deformation across various human poses, aiding radiotherapy and similar treatments. This approach builds a patient-specific model encoding the organ's shape and elasticity, allowing for deformation estimation based on the patient's current pose. This innovation offers clinicians precise guidance without additional scans or procedures. poster
	Sensorless Freehand 3D Ultrasound Reconstruction via Deep Contextual Learning Hengtao Guo, Sheng Xu, Bradford J. Wood, Pingkun Yan Medical Image Computing and Computer Assisted Intervention (MICCAI), 2020 We propose a deep contextual learning network (DCL-Net), which can efficiently exploit the image feature relationship between US frames and reconstruct 3D US volumes without any tracking device. The proposed DCL-Net utilizes 3D convolutions over a US video segment for feature extraction. An embedded self-attention module makes the network focus on the speckle-rich areas for better spatial movement prediction, with a novel case-wise correlation loss to stabilize the training process for improved accuracy. arxiv / code
	Ultrasound Volume Reconstruction From Freehand Scans Without Tracking Hengtao Guo, Hanqing Chao, Sheng Xu, Bradford J. Wood, Jing Wang, Pingkun Yan IEEE Transactions on Biomedical Engineering, 2022 We propose a deep contextual-contrastive network, utilizing self-attention to focus on the speckle-rich areas to estimate spatial movement and then minimizes a margin ranking loss for contrastive feature learning. We train and validate the model on two independent datasets, evaluated with including the DICE of reconstructed organ segmentation.
	Knowledge-based Analysis for Mortality Prediction from CT Images Hengtao Guo, Uwe Kruger, Ge Wang, Mannudeep K. Kalra, Pingkun Yan IEEE Journal of Biomedical and Health Informatics, 2019 This paper introduces a knowledge-based analytical method using deep convolutional neural network (CNN) for all-cause mortality prediction. The underlying approach combines structural image features extracted from CNNs, based on LDCT volume at different scales, and clinical knowledge obtained from quantitative measurements, to predict the mortality risk of lung cancer screening subjects.
	Ultrasound Frame-to-Volume Registration via Deep Learning for Interventional Guidance Hengtao Guo, Xuanang Xu, Xinrui Song, Sheng Xu, Hanqing Chao, Joshua Myers, Baris Turkbey, Peter A. Pinto, Bradford J. Wood, Pingkun Yan IEEE Transactions on Ultrasonics, Ferroelectrics, and Frequency Control, 2023 We propose a novel US frame-to-volume registration (FVReg) pipeline to bridge the dimensionality gap between 2-D US frames and 3-D US volume. The developed pipeline is implemented using deep neural networks, which are fully automatic without requiring external tracking devices. We validated our method on a clinical dataset with 618 subjects and tested its potential on real-time 2-D-US to 3-D-MR fusion navigation tasks. The proposed FVReg achieved an average target navigation error of 1.93 mm at 5 to 14 fps. code
	Deep learning predicts cardiovascular disease risks from lung cancer screening low dose computed tomography Hanqing Chao, Hongming Shan, Fatemeh Homayounieh, Ramandeep Singh, Ruhani Doda Khera, Hengtao Guo, Timothy Su, Ge Wang, Mannudeep K. Kalra, Pingkun Yan Nature Communications, 2021 Cancer patients have a higher risk of cardiovascular disease (CVD) mortality than the general population. Low dose computed tomography (LDCT) for lung cancer screening offers an opportunity for simultaneous CVD risk estimation in at-risk patients. Our deep learning CVD risk prediction model, trained with 30,286 LDCTs from the National Lung Cancer Screening Trial, achieves an area under the curve (AUC) of 0.871 on a separate test set of 2,085 subjects and identifies patients with high CVD mortality risks (AUC of 0.768).

Review Service

Computer Vision and Pattern Recognition (CVPR)
European Conference on Computer Vision (ECCV)
Medical Image Computing and Computer Assisted Intervention (MICCAI)
Machine Learning in Medical Imaging
Computerized Medical Imaging and Graphics
Artificial Intelligence with Biased or Scarce Data
Frontiers in Cardiovascular Medicine
Neurocomputing