"There's been a lot of concern about how machine learning will actually work within the medical field," said Allison Park, a Stanford graduate student in statistics and co-lead author of the paper. "This research is an example of how humans stay involved in the diagnostic process, aided by an artificial intelligence tool."
This tool, which is built around an algorithm called HeadXNet, improved clinicians' ability to correctly identify aneurysms at a level equivalent to finding six more aneurysms in 100 scans that contain aneurysms. It also improved consensus among the interpreting clinicians. While the success of HeadXNet in these experiments is promising, the team of researchers - who have expertise in machine learning, radiology and neurosurgery - cautions that further investigation is needed to evaluate generalizability of the AI tool prior to real-time clinical deployment given differences in scanner hardware and imaging protocols across different hospital centers. The researchers plan to address such problems through multi-center collaboration.
Augmented Expertise
Combing brain scans for signs of an aneurysm can mean scrolling through hundreds of images. Aneurysms come in many sizes and shapes and balloon out at tricky angles - some register as no more than a blip within the movie-like succession of images."Search for an aneurysm is one of the most labor-intensive and critical tasks radiologists undertake," said Kristen Yeom, associate professor of radiology and co-senior author of the paper. "Given inherent challenges of complex neurovascular anatomy and potential fatal outcome of a missed aneurysm, it prompted me to apply advances in computer science and vision to neuroimaging."
Yeom brought the idea to the AI for Healthcare Bootcamp run by Stanford's Machine Learning Group, which is led by Andrew Ng, adjunct professor of computer science and co-senior author of the paper. The central challenge was creating an artificial intelligence tool that could accurately process these large stacks of 3D images and complement clinical diagnostic practice.
To train their algorithm, Yeom worked with Park and Christopher Chute, a graduate student in computer science, and outlined clinically significant aneurysms detectable on 611 computerized tomography (CT) angiogram head scans.
"We labelled, by hand, every voxel - the 3D equivalent to a pixel - with whether or not it was part of an aneurysm," said Chute, who is also co-lead author of the paper. "Building the training data was a pretty grueling task and there were a lot of data."
Following the training, the algorithm decides for each voxel of a scan whether there is an aneurysm present. The end result of the HeadXNet tool is the algorithm's conclusions overlaid as a semi-transparent highlight on top of the scan. This representation of the algorithm's decision makes it easy for the clinicians to still see what the scans look like without HeadXNet's input.
"We were interested how these scans with AI-added overlays would improve the performance of clinicians," said Pranav Rajpurkar, a graduate student in computer science and co-lead author of the paper. "Rather than just having the algorithm say that a scan contained an aneurysm, we were able to bring the exact locations of the aneurysms to the clinician's attention."
Eight clinicians tested HeadXNet by evaluating a set of 115 brain scans for aneurysm, once with the help of HeadXNet and once without. With the tool, the clinicians correctly identified more aneurysms, and therefore reduced the "miss" rate, and the clinicians were more likely to agree with one another. HeadXNet did not influence how long it took the clinicians to decide on a diagnosis or their ability to correctly identify scans without aneurysms - a guard against telling someone they have an aneurysm when they don't.
To other Tasks and Institutions
The machine learning methods at the heart of HeadXNet could likely be trained to identify other diseases inside and outside the brain. For example, Yeom imagines a future version could focus on speeding up identifying aneurysms after they have burst, saving precious time in an urgent situation. But a considerable hurdle remains in integrating any artificial intelligence medical tools with daily clinical workflow in radiology across hospitals.Current scan viewers aren't designed to work with deep learning assistance, so the researchers had to custom-build tools to integrate HeadXNet within scan viewers. Similarly, variations in real-world data - as opposed to the data on which the algorithm is tested and trained - could reduce model performance. If the algorithm processes data from different kinds of scanners or imaging protocols, or a patient population that wasn't part of its original training, it might not work as expected.
"Because of these issues, I think deployment will come faster not with pure AI automation, but instead with AI and radiologists collaborating," said Ng. "We still have technical and non-technical work to do, but we as a community will get there and AI-radiologist collaboration is the most promising path."
Allison Park, Chris Chute, Pranav Rajpurkar, Joe Lou, Robyn L Ball, Katie Shpanskaya, Rashad Jabarkhee, Lily H Kim, Emily McKenna, Joe Tseng, Jason Ni, Fidaa Wishah, Fred Wittber, David S Hong, Thomas J Wilson, Safwan Halabi, Sanjay Basu, Bhavik N Patel, Matthew P Lungren, Andrew Y Ng, Kristen W Yeom.
Deep Learning - Assisted Diagnosis of Cerebral Aneurysms Using the HeadXNet Model.
JAMA Netw Open. 2019;2(6):e195600. doi: 10.1001/jamanetworkopen.2019.5600.