Schema: A Quantified Learning Solution to Augment, Assess, and Analyze Learning in Medicine
Deepu Sebin, Vishwin Doda, Skanthavelan Balamani

TL;DR
Schema is an e-learning app for medical students that uses quantified learning to provide personalized, self-directed learning and progress tracking.
Contribution
Schema introduces a novel app-based e-learning solution for medical education with integrated quantified learning and feedback mechanisms.
Findings
Schema organizes MCQs into granular subtopics and provides visual feedback to improve learning outcomes.
The app was made available in 2022 to support medical students in self-directed and competency-based learning.
Schema aims to offer insights into how medical students learn using technology-driven methods.
Abstract
Quantified learning is the use of digital technologies, such as mobile applications, cloud-based analytics, machine learning algorithms, and real-time performance tracking systems, to deliver more granular, personalized, and measurable educational experiences and outcomes. These principles, along with horizontal and vertical integrative learning, form the basis of modern learning methods. As we witness a global shift from traditional learning to competency-based education, educators agree that there is a need to promote quantified learning. The increased accessibility of technology in educational institutions has allowed unprecedented innovation in learning. The convergence of mobile computing, cloud computing, and Web 2.0 tools has made such models more practical. Despite this, little has been achieved in medical education, where quantified learning and technology aids are limited to a…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProblem and Project Based Learning · Innovations in Medical Education · Learning Styles and Cognitive Differences
Introduction
According to the WHO, more than 90,000 students were admitted to medical colleges in India, and 1.36 million worldwide in 2022. These students undertake the task of learning the nuances of medicine and transforming into competent medical professionals over the course of their undergraduate education. The students from Generation Z (born between 1997 and 2012) are the first cohort to have had access to modern technology during their undergraduate years, with ready access to the Internet. As a result, they are open to new learning techniques and technology and expect a personalized approach to medical learning [1]. E-learning, virtual learning environments, and simulated clinical experiences have inspired the need for a quantified, self-directed, and integrated approach to learning and assessing competencies in medicine. With changing paradigms, the learning systems in medical education must also evolve to cater efficiently and holistically to this new generation of learners. The opportunity to do so is evident in the fact that most medical students report using mobile devices to aid their learning in domiciliary and clinical settings more than ever before [2]. While methodologies such as problem-based learning (PBL) and team-based learning (TBL) have been incorporated into medical curricula since the 1970s, many institutions, particularly in developing regions, continue to rely heavily on subject-specific reading and subjective evaluations. However, a global trend toward competency-based education is accelerating the shift toward more integrated, technology-aided, and objective assessment models. Perhaps the single most crucial shift in modern-day medical education is the growing role of mobile apps, which have been shown to aid learning and improve performance in solving clinical problems [3-5]. This innovation report aims to describe Schema, an app-based solution, along with its development, dynamics, potential use cases, and scope. It also explores the application of Schema to recognize en masse trends in the competency of medical students in various topics of the curriculum by observing the percentage correctness of the responses to each multiple choice question (MCQ).
Technical report
Approach
Schema is a first-of-its-kind learning tool designed to promote self-directed, integrated, and quantified learning of high-yield subtopics with active feedback to the learner. This innovative approach has refined and granularized the core undergraduate medical curriculum into 1,947 "schemas" or sub-topics, which are easily navigable through our user interface. The development of Schema was driven by the need to categorize a set of 16,000 preclinical, paraclinical, and clinical Single Best Answer Multiple Choice Questions (SOC-MCQs) from 19 medical subjects, including Anatomy, Physiology, Biochemistry, Microbiology, Pathology, Pharmacology, Forensic Medicine, Ophthalmology, Otorhinolaryngology, Community Medicine, Medicine, Surgery, Obstetrics and Gynaecology, Pediatrics, Dermatology, Psychiatry, Radiology, Anaesthesiology, and Orthopedics into a more granular format for easier identification, navigation, and integration.
An ideal integrative system for quantified learning should satisfy certain parameters. It should granularize the medical curriculum into highly specific subtopics and be subject-agnostic to allow for seamless horizontal and vertical integration. Additionally, it should be easily navigable to facilitate access and have a definite organizational structure. The system should also be capable of providing feedback on the extent to which a learner has completed the curriculum and offer positive feedback regarding the performance of learners in each topic. Importantly, it should highlight high-yield topics so that the learner can maximize the benefits from their studies and not act as a shortcut to completing studies but rather as an augmentation to the learner’s efforts for efficient learning.
Curricular organization, integration, and alignment
Unlike the traditional models that organize the curriculum into a subject-specific hierarchy, we consolidated MCQs according to Schema subtopics, irrespective of the subject they would traditionally be categorized in. In other words, Schema is subject agnostic. Every MCQ is linked to all Schema subtopics related to it, and each Schema subtopic is linked to relevant MCQs that fall within its scope, in a non-exclusive manner as shown in Figure 1. This approach is novel in the domain of learning, with the resulting outcome being seamless vertical and horizontal integration while maintaining some boundaries within the content, thus removing redundancy and providing interconnectedness. This quasi-segregated model of curricular organization enables the use of Schema for a targeted approach in teaching, self-directed learning, assessment, and pinpointing knowledge gaps. Organizing the MCQs into their respective Schema subtopics was a challenging task. First, all the subtopics that each MCQ could be linked to were listed. The list of such subtopics was then compiled to create a central database containing all subtopics representing a granular subtopic independent of a subject or chapter. Some examples of these subtopics are “Ivermectin,” “Anaphylaxis,” and “Kernicterus.” Some subtopics were granularized into more specific subtopics, often when different aspects of a topic, disease, or condition were. For example, the subtopic Diabetes Mellitus was granularized into “Diabetes Mellitus: Pathophysiology and Clinical Features,” “Diabetes Mellitus: Investigations,” “Diabetes Mellitus: Treatment,” and “Diabetes Mellitus: Macrovascular and Non-vascular Complications.” It is to be noted that a significant number of MCQs were linked to more than one Schema topic, resulting in nesting integration across horizontal and vertical domains. At the end of this exercise, a total of 3010 such subtopics were identified, which nearly encompassed the entire medical curriculum in a granular format.
Systems of curricular organization: (A) traditional, (B) Schema model of integrative organization.MCQ: Multiple choice question.
The next exercise was to manually identify and label high-yield subtopics based on their cruciality in the clinical practice of a physician, and the changing paradigms of globally relevant diseases. A total of 1,947 such high-yield topics were recognized and compiled to form the Schema list of high-yield subtopics. As opposed to automated approaches such as text-mining, a manual approach was selected for identification and labeling of subtopics. Text-mining-based approaches may have several known drawbacks such as artifacts, lack of contextual understanding, and inability to consider semantic ambiguity or variability in terminology. Technology-based or automated detection of such overlaps has also been previously attempted using text mining techniques and has been shown to have several drawbacks [6-7]. The manual approach of categorizing each of the thousands of MCQs and analyzing overlaps is unique, avoids these drawbacks, and offers the additional advantage of a more insight-based identification of relevant subtopic tags. The manual approach helped us map overlaps within the curriculum of the same disease, anatomical structure, or conditions assessed under different subjects due to the different approaches practiced under different specializations, or are assessed on different occasions in the teaching timeline resulting in duplication of effort and reduced integrated learning.
The Schema interface
The interface of Schema serves the integral function of aiding navigation within the Schema by visually bringing together the extensive Schema database and presenting it to the learner in a user-friendly manner. The Schema interface serves the following functions:
Intuitively Navigate, Sort, and Analyze the Schema List of Topics
In order to make the navigation within the Schema intuitive for learners, a simple yet functional interface was desirable. It must provide the learner with intuitive navigation through Schema. The alphabetical sorting bar allows intuitive navigation to a specific Schema entry. The list of Schema topics can also be sorted alphabetically, from most to least completed and vice versa and from strongest to weakest and vice versa as shown in Figure 2A. This allows the learner to analyze their progress, focus efforts to cover the curriculum, focus on areas that need improvement, and further improve their strengths. The filter menu allows the learner to focus on Schema topics in which MCQs from that particular subject occur, as shown in Figure 2B.
Schema list interface depicting functions and feedback.
Tapping on a Schema title reveals the list of MCQs linked to it, organized by the module they originally covered and the date the module was last attempted. An option to bookmark any MCQ that the learner finds important or difficult is also provided, which facilitates review.
Provide Feedback About the Learner’s Progress and Performance
The Schema interface contains several elements that provide users with feedback on their progress and performance in both visual and quantified forms.
In the Schema list interface, these elements include:
Blue ticks: A blue tick conveys to the learner that all MCQs in a Schema topic have been attempted. Collectively, blue ticks allow the learner to perceive progress and the extent of curricular coverage.
Red titles: Red titles indicate a low average percent score (<50%) in a Schema topic. This provides the learner with feedback on which Schema topics need improvement.
Within an individual Schema topic, feedback elements include:
Solved MCQ count: It quantifies the number of MCQs solved by the learner out of the total MCQs in a given Schema topic. Furthermore, it quantifies the number of correct, wrong, and skipped MCQs, providing the learner with insight regarding their overall performance in a particular Schema topic.
Score: The percentage of MCQs in a Schema answered correctly by the learner out of the total number of MCQs attempted. A pie chart accompanies this to provide a visual representation. A score less than 50% is considered low and is denoted by a red title in the Schema list as shown above.
Green ticks and red crosses: Represent MCQs answered correctly and incorrectly, respectively. The learner can also filter either of these out with the help of the filter menu as shown in Figure 3.
Schema showing the user's performance and granular feedback.
Analysis of Performance and Trends in Medical Learning
It is possible to identify trends in the learning and performance of students by analyzing the pooled data of all learner groups on specific Schema topics. This can help locate learning gaps and assist in their closure through appropriate interventions. It can also offer insights into the necessity of focusing on specific competencies and changes in the learning landscape with changing illness burdens over time. We further explored this hypothesis through an in-depth retrospective analysis of anonymized Schema data. The granulated curricular organization of Schema, along with its capability to analyze pooled data, has the potential to observe, analyze, and augment the learning of cohorts of learners and the medical student community as a whole.
Unlike many learning tools and commercial MedEdTech platforms that primarily serve as content repositories or static test-preparation aids, Schema is designed as a dynamic, learner-centered ecosystem. It enables users to self-direct active learning, receive continuous actionable feedback, and engage with clinical reasoning through structured MCQs that test critical thinking in clinical scenarios and aids clinical decision-making skills and acumen. In this regard, Schema is a holistically designed system that combines the principles of quantified learning, technology-enhanced learning (TEL), longitudinal learning, self-directed learning (SDL), PBL, simulation-based learning (SBL), feedback-directed learning, and integration across domains [8-10]. These modalities have been shown to outperform traditional teaching and learning models [11]. Furthermore, unlike traditional classroom learning, it allows learners to tailor their learning according to their unique requirements, directions, and personal competencies, thus creating a personalized learning experience. This holistic approach allows Schema to be integrated into the current undergraduate learning and assessment models to improve outcomes and quantify competencies.
SOC-MCQs
Among the various components that constitute the Schema learning model, SOC-MCQs serve as a foundational tool for both learning and assessment. While SOC-MCQs are well-established for evaluating conceptual understanding and clinical reasoning, they inherently focus on structured problem-solving and may not fully capture communication skills, procedural proficiency, or nuanced decision-making. However, when embedded within a quantified, feedback-rich system like Schema, their effectiveness is significantly enhanced. Each MCQ in Schema is curated by a team of practicing doctors and is paired with detailed explanations that appear immediately after the learner submits a response. This pairing transforms SOC-MCQs from mere assessment tools into active learning stimuli that reinforce concepts and promote deeper cognitive engagement. Although SOC-MCQs alone have known limitations, their integration within Schema’s broader framework of feedback-driven, topic-wise progression and real-time performance tracking makes them a powerful tool for fostering self-directed, reflective learning [12].
Participants
We analyzed the anonymized data for 20 Schema topics, and the MCQs linked to them from 2018 to 2022 were shortlisted based on the relevance of knowledge in clinical practice. These topics included Anaphylaxis, ST-elevation myocardial infarction (STEMI) (STEMI), Covid-19 Vaccination, Glasgow Coma Scale, Subdural Hemorrhage, Syndromic Management of STIs, Laryngeal Mask Airway, BLS (Basic Life Support), Ischemic Stroke - Management, Ischemic Stroke - Risk Factors, Extradural Hemorrhage, NSTEMI, Acetaminophen, Management of Asthma, Diabetic Ketoacidosis (DKA), Hypertension - Management, Subarachnoid Hemorrhage, Vaccine Vial Monitoring, Vaccine & Drug Storage, and Community-Acquired Pneumonia - Management. The number of learners to attempt each MCQ ranged from 60,080 to 206,672.
Outcomes
A total of over 16,000 SOC-MCQs, spanning 19 core medical subjects, were non-exclusively mapped to 1,947 high-yield, subject-agnostic subtopics, which we refer to as Schema topics. Each MCQ could be linked to one or multiple Schema topics based on thematic and conceptual relevance. The number of MCQs assigned per topic varied between 1 and 47, depending on the complexity and curricular weightage of the subtopic. These topics were consolidated into a structured Schema interface that allowed users to intuitively navigate, filter, and track their progress across this granular curriculum. The interface provided real-time visual and quantitative feedback on topic-wise completion, accuracy, and performance trends. Schema was deployed as part of a mobile application available on both Android and iOS platforms and has been accessed by over 300,000 undergraduate medical students across the Indian subcontinent. Schema topics that showed statistically significant improvement, shown as an increase in percentage correctness over time, were primarily related to medical emergencies, including anaphylaxis, STEMI, Glasgow Coma Scale (36.73% improvement), acute subdural hemorrhage (5.29%), laryngeal mask airway devices (8.31%), and basic life support (BLS) (13.97%). This was determined through a retrospective, observational study design using anonymized performance data collected from users of the Schema app between 2018 and 2022.
The primary hypothesis was that Schema, as a quantified learning system, enables progressive improvement in learner competence over time for clinically important topics. For each Schema topic, we compared the average percent correctness of MCQ responses between two cohorts: an early cohort (2018-2019) and a recent cohort (2021-2022). An independent two-sample t-test was used to evaluate differences in mean performance across these cohorts. Statistical significance was considered at p < 0.001. The dataset comprised aggregated performance metrics from between 60,080 to 206,672 learner attempts per topic, which were obtained from the Schema platform’s backend analytics. The results support the hypothesis that quantified learning with timely feedback and topic-wise performance tracking can lead to measurable gains in competency, particularly in high-yield, time-sensitive clinical areas.
To assess Schema's capability for monitoring and improving competencies, we conducted a retrospective cohort analysis of anonymized performance data between 2018 and 2022. This analysis focused on 20 clinically relevant Schema topics. We observed that eight topics demonstrated statistically significant improvements in average MCQ correctness over time, as determined by an independent two-sample t-test (p < 0.001).
Discussion
This innovation report describes Schema, a holistically designed, quantified e-learning system for undergraduate medical education. The retrospective analysis referenced in this report involved pooled, anonymized performance data from over 300,000 medical students across a four-year period (2018-2022), focusing on 20 high-yield Schema topics. It examined changes in learner competence, measured by percent correctness of SOC-MCQ responses, across different time cohorts. To our knowledge, this is the first large-scale attempt to collectively quantify medical learning outcomes across specific clinical competencies in real-world settings. The findings illustrate potential use cases of Schema in tracking evolving competency trends, identifying knowledge gaps, and informing both individual learning paths and broader curriculum development strategies. Similar trend analyses on a larger scale hold the potential to provide insights into the changing paradigms of competence and competencies over time, which in turn can guide targeted efforts toward the improvement of teaching-learning outcomes, both at an individual and an institutional level. On an individual level, Schema allows each learner to quantify their learning, assess their knowledge, learn actively, improve their competence with the help of active feedback, and practice clinical decision-making skills by practicing clinical scenarios. It acts as a seamless and holistic integration of the first principles of modern learning techniques such as self-directed learning, active recall, spaced repetition, feedback-based improvement, and curriculum integration, and in doing so, innovates the undergraduate learning landscape.
Technology has long been known to have the potential to transform learning [13]. Quantification has been the focus of many efforts in higher education and has sparked debate regarding the right approach to measuring outcomes in education [14]. Medicine remains a domain where true quantification of learning, assessment, and outcomes has yet to be explored, likely due to the many challenges unique to medicine, such as the vast curriculum, overlapping knowledge areas, complex learning patterns, and regulations [13].
Regarding the impact of Schema as a tool for self-directed learning, assessment, and improvement in medical education, evidence-based observations need to be collected using various applicable research methodologies. Most notably, it needs to be explored in light of the metrics provided by Messik in 1984, as well as the five markers of validity jointly prescribed by the American Educational Research Association (AERA), the American Psychological Association (APA), and the National Council on Measurement in Education (NCME) [15,16]. Avenues for integrating Schema into currently existing medical education models, consequences of such integrations, predictive models, and potential impacts of Schema need to be determined and aligned to the rapidly changing medical education landscape [17]. The impact itself may reach beyond learning outcomes. Various studies have remarked on the profound impact of better physician knowledge on patient outcomes [18-20]. Furthermore, Schema can be expanded to other levels of medical education, such as specialized and superspecialized courses, where granularization of the curriculum can be useful for in-depth assessment.
Conclusions
Schema showcases vast potential to augment and disrupt the traditional ways we learn medicine. This first report paper demonstrates the different aspects of Schema and its applications. As we progress with the subsequent phases of understanding and developing Schema and its integrations, we plan to undertake further in-depth analysis and derive validity evidence. With this innovative solution, we aim to quantify learning, assessments, and improvements in academic outcomes for medical students and, by extension, hope to improve doctor-patient outcomes with a positive overarching impact on healthcare.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Is medical education ready for generation Z?J Grad Med Educ Eckleberry-Hunt J Lick D Hunt R 378381102018 https://pubmed.ncbi.nlm.nih.gov/30154963/3015496310.4300/JGME-D-18-00466.1PMC 6108364 · doi ↗ · pubmed ↗
- 2'It's on my i Phone': attitudes to the use of mobile computing devices in medical education, a mixed-methods study BMJ Open Wallace S Clark M White J 10992201210.1136/bmjopen-2012-001099 PMC 343283822923627 · doi ↗ · pubmed ↗
- 3Mobile learning in medicine: an evaluation of attitudes and behaviours of medical students BMC Med Educ Chase TJ Julius A Chandan JS 1521820182994557910.1186/s 12909-018-1264-5PMC 6020287 · doi ↗ · pubmed ↗
- 4Mobile applications in medical education: a systematic review and meta-analysis P Lo S One Chandran VP Balakrishnan A Rashid M 017202210.1371/journal.pone.0265927 PMC 894701835324994 · doi ↗ · pubmed ↗
- 5What is the impact of Apps in medical education? A study of CAPSULE, a case-based learning App BMJ Simul Technol Enhanc Learn Sadler J Wright J Vincent T Kurka T Howlett D 2932967202110.1136/bmjstel-2020-000593 PMC 893666235515743 · doi ↗ · pubmed ↗
- 6Pitfalls in users’ evaluation of algorithms for text-based similarity detection in medical education 2018 Federated Conference on Computer Science and Information Systems ŠčavnickýJ Karolyi M RůžičkováP Komenda M 109116 Belgrade, Serbia Fed CIS 152018
- 7Finding overlapping terms in medical and health care curriculum using text mining methods: rehabilitation representation - a proof of concept Mefanet J Karolyi M Komenda M JanoušováR Víta M Schwarz D 717742016 http://mj.mefanet.cz/mj-20161227
- 8The impact of a longitudinal curriculum on medical student obstetrics and gynecology clinical training Hawaii J Med Public Health Melo J Kaneshiro B Kellett L Hiraoka M 144147732014 https://pubmed.ncbi.nlm.nih.gov/24843837/24843837 PMC 4021731 · pubmed ↗
