
What Can We Expect From Machine Learning Predictions in Daily Clinical Neurology?
Key Takeaways
- Risk calculators must pair discrimination with interpretability, transparency, and sub-minute execution to earn trust and fit high-throughput neurology clinics.
- DAAE-M combines baseline disease status and treatment exposure to estimate 5-year probabilities of secondary progressive transition or high disability driven by progression independent of relapse.
Machine learning will only transform clinical neurology if predictive accuracy is matched by usability, transparency, and active physician involvement in the design and implementation of tools that support patient care.
We have entered the age of machine learning in neurologic research, with thousands of research articles in the last 2 years (openalex.org). So how soon can we expect new machine learning algorithms to reach clinical practice, and what can we expect in the future? It is certainly likely that machine-learning tools and algorithms will benefit our abilities to predict future outcomes.
Knowing the relative risk of serious neurologic events enable more nuanced personalized patient care and engender patient-physician dialogues about desired goals in relation to known risks. Although the future is always somewhat uncertain, machine learning can refine relative risk estimates – much the way a weather report tells us the risk of rain. Neurology is no stranger to the implementation of risk-based predictive tools. The release of the original CHADS-VASc calculator dates back to 2010.1 This tool, used to stratify patients by risk of stroke and thromboembolism, is easy to use and transparent in its calculations. Similar tools have also emerged recently, such as the SeLECT score2 for determining risk of seizure after ischemic stroke. Such tools will become even more accurate and consistent in their risk estimates as machine learning is incorporated into their development.
We recently developed the DAAE-M score3 as one example of how machine learning can be translated into clinically usable tools. The model estimates 5-year risk of disease progression in people with multiple sclerosis (MS) by combining information on disease status and treatment exposure. It predicts outcomes including transition to secondary progressive disease or progression to high disability with progression independent of relapse4–6—events that are clinically meaningful because they often mark a shift toward more rapid and less treatment-responsive deterioration.7,8
The DAAE-M score is freely available here:
However, the development of this tool also highlighted a broader lesson: accuracy alone is not sufficient for clinical adoption. From the outset, we prioritized not only accurate predictive performance but also interpretability, transparency, accessibility, usability, and speed. The final tool can be used in under 30 seconds and produces outputs that can be readily understood in a clinical conversation. This design philosophy reflects a more general requirement for machine learning in medicine—tools must fit clinical workflows if they are to be used at all.
In practice, the value of such tools may lie less in the prediction itself and more in how that prediction is used. Risk estimates can serve as a starting point for discussion, helping physicians and patients navigate decisions during key moments. Importantly, these tools are not prescriptive. Like a weather forecast, they provide probabilistic information that may influence decisions differently depending on the individual patient and clinical context. We view tools like the DAAE-M score as a source of free information. The reports themselves are another tool in the toolbelt, to be wielded when helpful.
One of the most striking observations from this work was the disconnect between methodological innovation and real-world usability. Machine learning is widely applied in research settings, but far fewer efforts are directed toward external validation, accessibility, or the end-user (the doctors) experience. As a result, many promising models remain impractical for daily use. Addressing this gap requires a shift in how such tools are developed—moving from a technology-centered approach toward one that is grounded in clinical realities. In our case, this meant involving practicing physicians throughout the development process. Their input shaped not only which outcomes were prioritized, but also how the tool presents information and how quickly it can be applied in practice. This type of collaboration led us to approach the project not just as researchers, but also as designers and software developers, an approach that may become increasingly important as machine learning moves closer to the bedside – and one requiring that physicians have a seat at the table in the design process.
Looking ahead, the integration of predictive tools into clinical care is likely to be iterative. As physicians begin to use more of these tools, expectations will evolve, and new needs will emerge. This feedback loop—between tool use and tool development—has the potential to drive more meaningful innovation than purely methodological advances. In this sense, the future of machine learning in neurology will not be defined solely by algorithmic improvements, but by how well these tools adapt to the needs of clinical practice. In this process, future research will need to proceed along two complementary lines. First, continued validation across diverse populations is essential to ensure that predictions remain reliable and generalizable. Second, there is a need to expand the range of clinically relevant outcomes that can be predicted. In MS, for example, this may include milestones such as high disability levels or significant cognitive dysfunction—outcomes that are relevant to patients but remain difficult to anticipate.
However, successful implementation of machine learning in neurology will depend heavily on the engagement of clinicians themselves. Physicians are uniquely positioned to identify the barriers to adoption: What makes a tool trustworthy? What would make it usable in a busy clinic? What information is actually needed to support decision-making or dialogues with patients? These questions should not be addressed after a model is built, but at the earliest stages of development. Ensuring that clinicians have a voice in this process will be critical to bridging the gap between innovation and impact. We therefore implore physicians to take part whenever possible. Share your ideas. Write directly to researchers. There are impressive efforts in the MS community, such as MSBase,9 the Czech National MS Registry,10 or the New York State MS consortium,11 for collaborating with clinical data across the world to improve the impact and generalizability of clinical research. If possible, take part or send your thoughts to these organizations about what you believe should be prioritized. There is an exciting future for machine learning in neurology and if we work together as a community, we can create the best tools to fit the needs of our evolving clinical practice.














