AI model can help determine where a patient’s cancer arose

AI model can help determine where a patient's cancer arose  MIT News

AI model can help determine where a patient’s cancer arose

AI model can help determine where a patient’s cancer arose

Mitigating Challenges in Cancer Treatment: Predicting the Origins of Unknown Cancers

Introduction

For a small percentage of cancer patients, doctors are unable to determine where their cancer originated. This makes it much more difficult to choose a treatment for those patients, because many cancer drugs are typically developed for specific cancer types.

The Role of Machine Learning in Identifying Cancer Origins

A new approach developed by researchers at MIT and Dana-Farber Cancer Institute may make it easier to identify the sites of origin for those enigmatic cancers. Using machine learning, the researchers created a computational model that can analyze the sequence of about 400 genes and use that information to predict where a given tumor originated in the body.

Using this model, the researchers showed that they could accurately classify at least 40 percent of tumors of unknown origin with high confidence, in a dataset of about 900 patients. This approach enabled a 2.2-fold increase in the number of patients who could have been eligible for a genomically guided, targeted treatment, based on where their cancer originated.

Implications for Personalized Treatment

“That was the most important finding in our paper, that this model could be potentially used to aid treatment decisions, guiding doctors toward personalized treatments for patients with cancers of unknown primary origin,” says Intae Moon, an MIT graduate student in electrical engineering and computer science who is the lead author of the new study.

Alexander Gusev, an associate professor of medicine at Harvard Medical School and Dana-Farber Cancer Institute, is the senior author of the paper, which appears today in Nature Medicine.

Addressing Cancers of Unknown Primary (CUP)

In 3 to 5 percent of cancer patients, particularly in cases where tumors have metastasized throughout the body, oncologists don’t have an easy way to determine where the cancer originated. These tumors are classified as cancers of unknown primary (CUP).

This lack of knowledge often prevents doctors from being able to give patients “precision” drugs, which are typically approved for specific cancer types where they are known to work. These targeted treatments tend to be more effective and have fewer side effects than treatments that are used for a broad spectrum of cancers, which are commonly prescribed to CUP patients.

Research Methodology and Results

Moon, an affiliate of the Computer Science and Artificial Intelligence Laboratory who is co-advised by Gusev, decided to analyze genetic data that is routinely collected at Dana-Farber to see if it could be used to predict cancer type. The data consist of genetic sequences for about 400 genes that are often mutated in cancer. The researchers trained a machine-learning model on data from nearly 30,000 patients who had been diagnosed with one of 22 known cancer types. That set of data included patients from Memorial Sloan Kettering Cancer Center and Vanderbilt-Ingram Cancer Center, as well as Dana-Farber.

The researchers then tested the resulting model on about 7,000 tumors that it hadn’t seen before, but whose site of origin was known. The model, which the researchers named OncoNPC, was able to predict their origins with about 80 percent accuracy. For tumors with high-confidence predictions, which constituted about 65 percent of the total, its accuracy rose to roughly 95 percent.

After those encouraging results, the researchers used the model to analyze a set of about 900 tumors from patients with CUP, which were all from Dana-Farber. They found that for 40 percent of these tumors, the model was able to make high-confidence predictions.

Validating the Model’s Predictions

The researchers then compared the model’s predictions with an analysis of the germline, or inherited, mutations in a subset of tumors with available data, which can reveal whether the patients have a genetic predisposition to develop a particular type of cancer. The researchers found that the model’s predictions were much more likely to match the type of cancer most strongly predicted by the germline mutations than any other type of cancer.

To further validate the model’s predictions, the researchers compared data on the CUP patients’ survival time with the typical prognosis for the type of cancer that the model predicted. They found that CUP patients who were predicted to have cancer with a poor prognosis, such as pancreatic cancer, showed correspondingly shorter survival times. Meanwhile, CUP patients who were predicted to have cancers that typically have better prognoses, such as neuroendocrine tumors, had longer survival times.

Enhancing Treatment Decisions

Another indication that the model’s predictions could be useful came from looking at the types of treatments that CUP patients analyzed in the study had received. About 10 percent of these patients had received a targeted treatment, based on their oncologists’ best guess about where their cancer had originated. Among those patients, those who received a treatment consistent with the type of cancer that the model predicted for them fared better than patients who received a treatment typically given for a different type of cancer than what the model predicted for them.

Using this model, the researchers also identified an additional 15 percent of patients (2.2-fold increase) who could have received an existing targeted treatment, if their cancer type had been known. Instead, those patients ended up receiving more general chemotherapy drugs.

“That potentially makes these findings more clinically actionable because we’re not requiring a new drug to be approved. What we’re saying is that this population can now be eligible for precision treatments that already exist,” Gusev says.

Future Directions

The researchers now hope to expand their model to include other types of data, such as pathology images and radiology images, to provide a more comprehensive prediction using multiple data modalities. This would also provide the model with a comprehensive perspective of tumors, enabling it to predict not just the type of tumor and patient outcome, but potentially even the optimal treatment.

Conclusion

The research was funded by the National Institutes of Health, the Louis B. Mayer Foundation, the Doris Duke Charitable Foundation, the Phi Beta Psi Sorority, and the Emerson Collective.

SDGs, Targets, and Indicators

  1. SDG 3: Good Health and Well-being

    • Target 3.4: By 2030, reduce by one-third premature mortality from non-communicable diseases through prevention and treatment and promote mental health and well-being.
    • Indicator 3.4.1: Mortality rate attributed to cardiovascular disease, cancer, diabetes, or chronic respiratory disease.
  2. SDG 9: Industry, Innovation, and Infrastructure

    • Target 9.5: Enhance scientific research, upgrade the technological capabilities of industrial sectors in all countries, in particular developing countries, including, by 2030, encouraging innovation and increasing the number of research and development workers per 1 million people.
    • Indicator 9.5.1: Research and development expenditure as a proportion of GDP.

The article addresses the issue of identifying the sites of origin for cancers of unknown primary (CUP) through the use of machine learning and genetic data analysis. Based on this information, the following SDGs, targets, and indicators can be identified:

1. Which SDGs are addressed or connected to the issues highlighted in the article?

  • SDG 3: Good Health and Well-being
  • SDG 9: Industry, Innovation, and Infrastructure

The article discusses the development of a computational model using machine learning to predict the origin of tumors in cancer patients. This technology contributes to improving health outcomes and precision treatments for patients with cancers of unknown primary origin, aligning with SDG 3. Additionally, the use of machine learning and genetic data analysis represents innovation and technological advancements in the field of cancer research, connecting to SDG 9.

2. What specific targets under those SDGs can be identified based on the article’s content?

  • Target 3.4: By 2030, reduce by one-third premature mortality from non-communicable diseases through prevention and treatment and promote mental health and well-being.
  • Target 9.5: Enhance scientific research, upgrade the technological capabilities of industrial sectors in all countries, in particular developing countries, including, by 2030, encouraging innovation and increasing the number of research and development workers per 1 million people.

The article’s content relates to Target 3.4 as it focuses on improving treatment decisions and guiding doctors towards personalized treatments for patients with cancers of unknown primary origin. By accurately predicting the origin of tumors, precision treatments can be administered, potentially reducing premature mortality rates associated with non-communicable diseases. Furthermore, the development of the computational model using machine learning and genetic data analysis contributes to Target 9.5 by enhancing scientific research and technological capabilities in the field of cancer research.

3. Are there any indicators mentioned or implied in the article that can be used to measure progress towards the identified targets?

  • Indicator 3.4.1: Mortality rate attributed to cardiovascular disease, cancer, diabetes, or chronic respiratory disease.
  • Indicator 9.5.1: Research and development expenditure as a proportion of GDP.

The article does not explicitly mention these indicators, but they can be used to measure progress towards the identified targets. Indicator 3.4.1 can be used to track the reduction in premature mortality rates associated with non-communicable diseases, including cancer. Indicator 9.5.1 can be used to monitor the allocation of resources and investment in research and development in the field of cancer research and technological advancements.

4. SDGs, Targets, and Indicators Table

SDGs Targets Indicators
SDG 3: Good Health and Well-being Target 3.4: By 2030, reduce by one-third premature mortality from non-communicable diseases through prevention and treatment and promote mental health and well-being. Indicator 3.4.1: Mortality rate attributed to cardiovascular disease, cancer, diabetes, or chronic respiratory disease.
SDG 9: Industry, Innovation, and Infrastructure Target 9.5: Enhance scientific research, upgrade the technological capabilities of industrial sectors in all countries, in particular developing countries, including, by 2030, encouraging innovation and increasing the number of research and development workers per 1 million people. Indicator 9.5.1: Research and development expenditure as a proportion of GDP.

Behold! This splendid article springs forth from the wellspring of knowledge, shaped by a wondrous proprietary AI technology that delved into a vast ocean of data, illuminating the path towards the Sustainable Development Goals. Remember that all rights are reserved by SDG Investors LLC, empowering us to champion progress together.

Source: news.mit.edu

 

Join us, as fellow seekers of change, on a transformative journey at https://sdgtalks.ai/welcome, where you can become a member and actively contribute to shaping a brighter future.