Large Collection of Brain Cancer Data Now Easily, Freely Accessible to Global Researchers

A valuable cache of brain cancer biomedical data has been made freely available to researchers worldwide, say researchers at Georgetown Lombardi Comprehensive Cancer Center. The dataset, REMBRANDT (REpository for Molecular BRAin Neoplasia DaTa) hosted and supported by Georgetown, is one of only two such large collections in the country.

Information about the brain cancer data collection, which contains information on 671 adult patients collected from 14 contributing institutions, is detailed in Scientific Data, an open-access journal (Nature). Already, thousands of researchers in the U.S. and internationally log on to the data site on a daily basis, and word about the resource is expected to increase its use, says Subha Madhavan, PhD, chief data scientist at Georgetown University Medical Center and director of the Innovation Center for Biomedical Informatics (ICBI) at Georgetown Lombardi.

The Georgetown data resource is unique in several ways. One is that it contains genomic information, collected from volunteer patients who allowed their tumors to be sampled, as well as diagnostic (including brain scans), treatment and outcomes data. Most collections contain either one or the other.

Additionally, the data collection interface is extraordinarily easy to use, Madhavan says.

"It sits on Amazon Web Services, and has a simple web interface access to data and analysis tools. All a researcher needs is a computer and an internet connection to log onto this interface to select, filter, analyze and visualize the brain tumor datasets.

"We want this data to be widely used by the broadest audience - the entire biomedical research community - so that imagination and discovery is maximized," says first author on the paper Yuriy Gusev, PhD, associate professor and a faculty member of the ICBI. "Our common goal is to tease apart the clues hidden within this biomedical and clinical information in order to find ways that advance diagnostic and clinical outcomes for these patients."

"We are just beginning to understand the science of how these cancers evolve and how best to treat them, and datasets like this will likely be very helpful," Madhavan says.

The REMBRANDT dataset was originally created at the National Cancer Institute and funded by Glioma Molecular Diagnostic Initiative led by co-authors Howard Fine, MD, from New York Presbyterian Hospital, and Jean-Claude Zenklusen, PhD, from the National Cancer Institute. They collected the data from 2004-2006.

The NCI transferred the data to Georgetown in 2015, and it is now physically located on the Georgetown Database of Cancer (G-DOC), a cancer data integration and sharing platform for hosting alongside other cancer studies. G-DOC investigators, led by Madhavan, developed novel analytical tools to process the information anew.

The genomic data includes the specific genes within individual tumors that are either over-expressed or under-expressed as well as the number of times that gene is repeated within a chromosome.

"We inherit two copies of a gene - one from Mom and one from Dad - but in cancer cells, DNA segments containing important tumor suppressor or onco- genes can be entirely deleted or amplified. It isn't unusual to see a chromosome within a tumor that has 11 copies of a gene, each of which may be producing a toxic protein that helps the cancer grow uncontrollably," she says.

The data collection also includes information on RNA, which is produced by genes (DNA) and can be measured to assess genes that are dysregulated.

Researchers can search their gene of interest, check their expression and amplification status and link that to clinical outcomes, Madhavan says. They can save their findings to their workspace on the G-DOC site and share with their collaborators. Given the approximately 20,000 protein coding genes in the human genome, and the variety of brain cancer tumor types, "it will take a big village - really a vast metro area - of investigators to understand the bases of these tumors and to effectively develop treatments that target them."

REMBRANDT includes genomic data from 261 samples of glioblastoma, 170 of astrocytoma, 86 tissues of oligodendroglioma, and a number that are mixed or of an unknown subclass. Outcomes data include more than 13,000 data points.

Yuriy Gusev, Krithika Bhuvaneshwar, Lei Song, Jean-Claude Zenklusen, Howard Fine, Subha Madhavan.
The REMBRANDT study, a large collection of genomic data from brain cancer patients.
Scientific Data volume 5, Article number: 180158 (2018). doi: 10.1038/sdata.2018.158.

Most Popular Now

MEDICA 2024 + COMPAMED 2024: Adapted Hal…

11 - 14 November 2024, Düsseldorf, Germany. The final preparations for MEDICA 2024 and COMPAMED 2024 in Düsseldorf have begun. A total of more than 5,500 exhibitors from approximately 70 countries...

AI does Not Necessarily Lead to more Eff…

The use of artificial intelligence (AI) in hospitals and patient care is steadily increasing. Especially in specialist areas with a high proportion of imaging, such as radiology, AI has long...

Commission Joins Forces with Venture Cap…

The Commission has launched a Trusted Investors Network bringing together a group of investors ready to co-invest in innovative deep-tech companies in Europe together with the EU. The Union's investment...

An AI-Powered Pipeline for Personalized …

Ludwig Cancer Research scientists have developed a full, start-to-finish computational pipeline that integrates multiple molecular and genetic analyses of tumors and the specific molecular targets of T cells and harnesses...

Philips and Medtronic Advocacy Partnersh…

Royal Philips (NYSE: PHG, AEX: PHIA), a global leader in health technology, and Medtronic Neurovascular, a leading innovator in neurovascular therapies, today announced a strategic advocacy partnership. Delivering timely stroke...

Wearable Cameras Allow AI to Detect Medi…

A team of researchers says it has developed the first wearable camera system that, with the help of artificial intelligence (AI), detects potential errors in medication delivery. In a test whose...

AI could Transform How Hospitals Produce…

A pilot study led by researchers at University of California San Diego School of Medicine found that advanced artificial intelligence (AI) could potentially lead to easier, faster and more efficient...

New AI Tool Predicts Protein-Protein Int…

Scientists from Cleveland Clinic and Cornell University have designed a publicly-available software and web database to break down barriers to identifying key protein-protein interactions to treat with medication. The computational tool...

AI for Real-Rime, Patient-Focused Insigh…

A picture may be worth a thousand words, but still... they both have a lot of work to do to catch up to BiomedGPT. Covered recently in the prestigious journal Nature...

Start-Ups will Once Again Have a Starrin…

11 - 14 November 2024, Düsseldorf, Germany. The finalists in the 16th Healthcare Innovation World Cup and the 13th MEDICA START-UP COMPETITION have advanced from around 550 candidates based in 62...

New Research Shows Promise and Limitatio…

Published in JAMA Network Open, a collaborative team of researchers from the University of Minnesota Medical School, Stanford University, Beth Israel Deaconess Medical Center and the University of Virginia studied...

G-Cloud 14 Makes it Easier for NHS to Bu…

NHS organisations will be able to save valuable time and resource in the procurement of technologies that can make a significant difference to patient experience, in the latest iteration of...