MSK Archive Receives Digitization Grant

The Archives at Memorial Sloan Kettering was awarded a METRO Digitization grant to reformat and make public MSK’s Annual Reports from 1884 to present.

The Metropolitan New York Library Council is a group that supports libraries, museums, archives, and community organizations within New York City. METRO’s annual digitization project grant program provides funding to make vital content from cultural heritage institutions in New York available. By securing this grant, MSK joins the ranks of other medical archives in New York, including Mount Sinai Health System and Weill Cornell Medicine, in bringing parts of their collections online.

Groundbreaking of Memorial Hospital

The Archives collects materials documenting the history, accomplishments, and daily life at MSK. This includes photographs, Board minutes, research notes, and event materials, among other materials. The images in this post offer just a glimpse of the richness of these collections. Although the Archives has existed in some form since 1980, it has expanded in recent years due to the hiring of its first full-time Archivist in 2022. Through this grant and other work, the Archives hopes to increase access
to certain materials for researchers in the history of medicine, genealogists, and others who are interested in the cultural value of MSK’s history.

The Annual Reports provide information on annual statistics, major discoveries, donations, and stories
of people touched by cancer. In addition to documenting medical history, these reports also document the growth of MSK across New York City and the metro areas, the development of new areas of practice, and our changing relationship to cancer and mortality.

The collection is expected to be available in early 2026.

MSK Radiation Therapy Service, circa 1949

Join us for “Adventures in Text Mining: Applications, Ethics, and Cancer Care”

Promotional banner for Adventures in Text Mining eventJoin us for our webinar “Adventures in Text Mining: Applications, Ethics, and Cancer Care” on October 16 from 12:00 PM-1:00 PM Eastern Time.

What is Text Mining?
Text mining helps researchers sift through mountains of documents, clinical notes, and research papers to find important patterns and information quickly. Dr. Manika Lamba (Assistant Professor, School of Library and Information Studies, University of Oklahoma) will introduce the topic through the lens of her work in digital libraries and information organization.

Applications in Cancer Care
Dr. Anyi Li (Chief, Associate Attendings, Department of Medical Physics, Memorial Sloan Kettering) will explain how applying text mining technologies to clinical notes at MSK has automated radiation therapy processes, saving clinician time and allowing for risk event analysis and mitigation. He will address the ethical aspects of text mining in healthcare, including patient privacy and responsible data use.

Applications in the Published Literature
Text mining can allow researchers to analyze the vast volume of scientific literature. Dr. Zhiyong Lu (Senior Investigator, NIH/NLM, Deputy Director for Literature Search, NCBI) will showcase his work mining the literature in PubMed, which led to tools including the Best Match algorithm and LitCovid. 

Register now. All registrants will receive a link to the event recording, whether or not they can attend synchronously.

About the speakers:

Dr. Manika Lamba is an Assistant Professor at the School of Library and Information Studies, University of Oklahoma. Previously, she served as a Postdoctoral Research Associate at the HathiTrust Research Center, University of Illinois. Her research broadly falls under computational social science and science of science. She primarily focuses on using computational methods, such as text mining and machine learning, to provide better solutions for information retrieval and organization of digital libraries.

Dr. Anyi Li, Associate Attending Physicist and Chief of Computer Service at the Department of Medical Physics at MSK, leads a talented team comprising mathematicians, physicists, engineers, and data scientists. Together, they collaborate with the Division of Clinical Physics and the Department of Radiation Oncology to harness artificial intelligence, operational research algorithms, and big data. Their objective is to optimize radiation therapy plans, enhance the efficiency of the radiation treatment process from start to finish, develop a data platform for clinical decision support, and improve patient safety by managing accumulated radiation doses. They utilize the latest language models to analyze clinical event timelines and construct workflow knowledge graphs, which improve the radiation therapy workflow and provide valuable insights to the clinical team. With a background as a theoretical nuclear physicist and research scientist tackling NP-hard (nondeterministic polynomial time) problems, Dr. Li transitioned into big data engineering and AI, bringing experience from positions at Yahoo and IBM Watson Health.

Dr. Zhiyong Lu is a tenured Senior Investigator at the NIH/NLM IPR, leading research in biomedical text and image processing, information retrieval, and AI/machine learning. In his role as Deputy Director for Literature Search at NCBI, Dr. Lu oversees the overall R&D efforts to improve literature search and information access in resources like PubMed and LitCovid, which are used by millions worldwide each day. Additionally, Dr. Lu is Adjunct Professor of Computer Science at the University of Illinois Urbana-Champaign (UIUC). With over 400 peer-reviewed publications, Dr. Lu is a highly cited author, and a Fellow of the American College of Medical Informatics (ACMI) and the International Academy of Health Sciences Informatics (IAHSI).