Building a Case for Agent-Assisted Learning as a Catalyst for Curriculum Reform in Medical Education

Erin Shaw, Rajaram Ganeshan, and W. Lewis Johnson
Center for Advanced Research in Technology for Education (CARTE)
Information Sciences Institute, University of Southern California
4676 Admiralty Way, Marina del Rey, CA 90292-6695 USA
{shaw, rajaram, johnson}@isi.edu, http://www.isi.edu/isd/carte/

Douglas Millar
Graduate School of Education and Psychology, Pepperdine University
400 Corporate Pointe Drive, Culver City, CA 90230 USA
dougnhelen@moonlink.net

 

Abstract: Animated pedagogical agents offer promise as a means of making computer-aided learning more engaging and effective. Realizing its full potential will involve integrating agents into curricula on a massive scale. This paper describes progress toward this end: the development of an agent-assisted learning environment designed for widespread use in health science curricula. The system features Adele, an animated pedagogical agent who guides and assesses students as they work through clinical cases. We present the results of an initial evaluation of Adele by twenty-five medical students, and draw several conclusions about computer-based tutoring in a clinical domain. We also describe our experience working with the medical faculty to design and build the clinical case for the evaluation, and propose a process for large-scale case authoring.

Keywords: Animated pedagogical agent, intelligent tutoring, real-world evaluation, simulation-based training

  1. Introduction
  2. Animated pedagogical agent technology has been proposed as a new approach for making computer-based learning more engaging and effective [Johnson 1998]. It builds upon previous work on intelligent tutoring systems [Wenger 1987] and extends it in several important respects. As we view the concept, animated pedagogical agents are a type of autonomous agent [Johnson and Hayes-Roth 1998]: they are capable of pursuing goals in complex environments, adapting their behavior as needed in response to unforeseen events. Their environment is typically an educational simulation, together with the learners and other agents that interact with the simulation. A pedagogical agent may seek to achieve pedagogical goals (e.g., to help a learner to learn about a topic), communicative goals (e.g., to acknowledge a learner's action), and task goals (e.g., to help the learner solve some particular problem). Animated pedagogical agents also have life-like, animated personas. They can respond to learners with a combination of verbal communication and non-verbal gestures such as gaze, pointing, body stance, and head nods. They can convey emotions such as surprise, approval, or disappointment. Taken together these capabilities allow animated pedagogical agents to interact with learners in a manner that is closer to face-to-face collaborative learning.

    The technical sophistication of animated pedagogical agents has progressed rapidly. Steve, a 3D animated agent, can interact with learners in individual and team scenarios [Rickel and Johnson 1997, 1998]. PPP Persona is able to generate tutorial presentations of Web-based learning materials [André et. al 1998]. Cosmo is able to generate critiques and explanations using a combination of speech and emotive gestures [Towns et. al 1998]. Early empirical results show that these agents can enhance the learning experience and improve its effectiveness [Lester et. al 1997].

    In this paper we describe our experience employing Adele, an Agent for Distance Education, who is designed to help realize the potential of animated pedagogical agent technology in on-line education [Shaw et. al 1999]. Adele operates in a Web-based, distributed simulation environment, where she monitors students as they solve problems, gives feedback, points them to relevant on-line reference materials, and evaluates their performance. She is designed to be used in a wide range of health science courses; here, we focus on our work to support clinical skills learning in medical education.

    This paper presents important progress toward widespread adoption of agent-assisted learning techniques. Empirical studies of animated pedagogical agents have shown promising results with school children, but it was not known whether university students and practicing professionals would respond positively as well. We present the results of a formal evaluation of Adele by twenty-five medical students, and draw several conclusions about computer-based tutoring in a clinical domain. We also describe our experience working with the faculty and staff at the Medical School to design and build the clinical case for the evaluation, and propose a process for large-scale case authoring.

  3. Adele system overview
  4. Adele's system, shown in Figure 1, consists of four main components: the pedagogical agent, the simulation, the client and server, and the server store. The pedagogical agent consists further of two sub-components, the animated persona and the reasoning engine. A fifth component, the session manager, is employed when the system is run in multiple-user mode. The central server implements gate-keeping services, maintains a database of student progress, and when appropriate, provides synchronization for collaborative exercises carried out by multiple students on multiple computers.

    The reasoning engine performs all monitoring and decision making. Its decisions are based on the case task plan and an initial state, which are downloaded from the server when a case is chosen, and the agent's current mental state, including the student model,

     

    Figure 1. Architectural overview of Adele's system.

    which is updated as a student works through a case. The simulation can be authored using the language or authoring tool of one's choice. All simulations communicate with the agent via a common application programming interface (API) that supports event and state change notifications as defined by the simulation logic. The two-dimensional animated persona is simply a Java applet that can be used alone with a JavaScript interface or incorporated into a larger application such as the one we describe here.

    2.1 Task representation and reasoning

    Previous efforts in medical intelligent tutoring (e.g., [Clancey 1983] and [Azevedo et. al 1997]) have either been large expert systems incorporating sizeable medical knowledge bases, or have been prototypes focusing on narrow sub-fields of medicine. Neither approach is suitable for a system like Adele, which is designed to apply to variety of health science courses, yet must be downloaded and run on client computers. Adele employs a different approach: she encodes for each case only the knowledge needed to tutor that particular case.

    Decisions about what knowledge to include are made when the case is authored. Adele's formal knowledge consists mainly of the procedural knowledge necessary to work through a case, and is represented as a hierarchical plan [Russell and Norvig 1995]. Related medical knowledge, such as background information about disease etiologies, is incorporated into textual hints and Web-based reference materials, and can thus be presented to the learner as needed, but is not represented formally within Adele's knowledge base. This simplifies the run-time knowledge base and reasoning engine.

    2.2 Opportunistic learning

    Adele supports two kinds of opportunistic learning: 1) quizzes to assess a student's understanding of the material, and 2) pointers to Web-page based reference material on the current subject. These pedagogical opportunities are represented as situations in a situation space [Marsella and Schmidt 1990]. A situation space provides a means of structuring the space of states associated with a domain so that they can be used to guide dynamic behavior. By maintaining an awareness of the current situation the agent can undertake contextually-appropriate interactions with the student.

  5. Case-based medical education
  6. The School of Medicine at the University of Southern California has begun to implement a first and second year curriculum that emphasizes the analysis of clinical cases. 'Clinical case'-based education is, more generally speaking, a case-based approach to instruction [Gragg 1940], one in which patient-physician cases create a familiar context for learning. The move to a case-based approach in medical instruction is in response to the problem of knowledge application, whereby a student may recall learned knowledge but can not apply it in an operational setting [Whitehead 1929]. While not particular to the medical domain, the problem is pervasive within it.

    A clinical case typically consists of a patient's chief complaint, the clinical findings, the differential diagnoses with an emphasis on determining a final diagnosis, and the treatment and management of the disease. Although created for presentation in a classroom setting, the cases are inherently situated and interactive, and with effort, can be reconstructed as simulation-based diagnosis exercises. In this section we explain how Adele guides students through one of these interactive exercises and discuss the benefits and drawbacks of computer-based tutoring in the medical domain.

    Figure 2. Adele explains the importance of palpating the patient's abdomen

    3.1 Multiple-level learning design

    Figure 2 shows a typical case-based diagnosis exercise in which students are presented with a simulated patient in a clinical setting. In the role of physicians, students are able to perform a variety of actions on the simulated patient; they may ask questions about medical history, perform a physical examination, order diagnostic tests, and make diagnoses. Adele monitors the student's actions and provides feedback accordingly. Depending upon the instructional goals, Adele may highlight aspects of the case, suggest correct actions, provide hints and rationales for particular actions, reference relevant background material, and provide contextual assessment.

    A clinical case presents a rich context for learning at many levels. Adele can emphasize the procedure, namely, the best practice approach to a particular diagnosis, or she can concentrate on the related learning materials. Exercises can focus on different aspects of a case: in the medical domain, we may have pre-clinical exercises, which stress fundamental facts; clinical exercises, which stress clinical diagnosis; or resident-level exercises, which stress treatment and management. One case can support all three levels, although the pedagogical emphasis of the learning task changes for each level.

    3.2 Computer-based learning design

    The pedagogical emphasis also changes with respect to the learning content, in order to exploit the benefits, or mitigate the drawbacks, of computer-based learning. For example, discovery by palpation and percussion, two forms of touch, cannot be simulated on a computer. Teaching listening skills is problematic, too, because it is difficult to record high fidelity body sounds without special equipment. The best an application can do is to provide a textual description of what something feels like or sounds like.

    On the other hand, the visual bias of the computer makes it especially good for teaching observation skills, something that is often under-emphasized in clinical settings, where students are apt to miss important visual cues in their rush to complete an examination. We can exploit visual opportunities further: for example, after the simulation response, "The throat is not inflamed.", when examining a patient's throat, Adele will display and compare images that show both normal and inflamed throats.

  7. Case Development
  8. If Adele is to be adopted as part a system-wide curriculum for case-based medical education, the issue of authoring the task representation and simulation logic for each case becomes very important. This section discusses issues in large-scale case authoring, and tools to assist in this process.

    Adele's interactive cases are developed by layering domain, simulation, and pedagogical data. Domain knowledge, such as reusable case and disease data can be combined to produce a best practice task model. This task model specifies an ordering of steps to be taken, places constraints on the steps, and relates the steps to the diagnosis. Rationales and hints for the steps are added here. The amount of case material that is used to build the task model depends on the instructional level and objective. Once the task model is completed, the author supplies the media assets needed for the simulation. Media include text for Adele's responses, and text, image, video and audio files for the patient and clinical findings. The pedagogical model, which consists of contextual comments, reference materials, and quizzes for assessment, is then layered over the resulting task model and simulation.

    4.1 Building an authoring tool

    Currently, case authors enter data for new cases via a form that sits on top of a database. A translator takes the output and parses it into an object-oriented case representation scheme from which both the task representation as well as the simulation logic for the case are generated automatically. The translation process performs a variety of syntactic consistency checks on the input provided by the author and also saves the author the burden of matching simulation events with the actions in the plan. The current tool isn't ideal; authoring a non-sequential path through a case can be confusing, and it does not yet provide for a persistent repository so that data can be easily reused across cases. We are actively designing an authoring tool with these qualities that will meet the needs of both instructors and developers.

    In their paper on authoring via task-specific tools, Bell and Zirkel [1997] argue that many ITS authoring tools sacrifice pedagogical power for content flexibility but that developing an authoring tool for a specialized category of instructional applications has advantages even so. We agree that while targeted authoring tools do indeed sacrifice flexibility, it need not be for pedagogical power. Power is afforded to authors who understand the types of tasks they want to teach, and the constraints that are inherent within those tasks. From our experiences with one-on-one authoring between physicians and developers, we propose that a successful authoring tool must be

    1. Domain-intuitive: Medical professionals understand how clinical cases are constructed and presented, and their knowledge of the domain can be accounted for to help tailor a system for them, rather than one for the programmer, or architecture;
    2. Reusable: Productivity and ease of use are increased for authors who can select from a repository of previously-authored steps and rationales, etc., instead of supplying new information each time. Standard ontologies [19] can also serve as reusable knowledge bases from which knowledge for cases is drawn as needed;
    3. Testable: Case authors need feedback on the cases they author and familiarization with the system they are authoring for will make them better authors. Adele's system may be a good candidate for an authoring tool that utilizes Programming By Demonstration (PBL). Diligent [Angros et. al 1998], is a good example of a PBL system in which users author tasks within the tutoring environment; it has been shown that instructors more accurately do in context, than explain out of context.
  9. Evaluation
  10. In this section we describe the initial evaluation of Adele. The evaluation was given to a class of second year medical students during the third week of November 1998 and was based on a new case on Lung Cancer that the students had studied in class. The students were unmonitored and worked through the case on their own time. Over one hundred students used the system, although only twenty-five of them completed the final questionnaire. Two face-to-face evaluations were also conducted.

    Our goals for the evaluation were twofold: to discover how the students would react to Adele, and to confirm that the system could support the students. Did Adele and the concept of Web-based exercises have potential in a medical school environment? Was the Adele system easy and robust enough to be used in such an asynchronous setting? The final questionnaire contained thirty questions in six categories: system use, system components, rationales, Adele, and learning. They addressed both specific elements of the tutoring system, such as the interface and rationales, and the general reaction of the students to Adele and the concept of the system. Answers were scored on a Likert scale.

    Evaluation question

    Strong disagree %

    Disagree

    %

    Neutral

    %

    Agree

    %

    Strong

    agree %

    The system was easy to use (combined results)

    2

    8

    15

    56

    19

    It was easy to figure out what to do

    4

    24

    20

    44

    8

    It was easy to figure out how to do it

    12

    16

    24

    40

    8

               

    Adele is a good distance education tool

    4

    8

    16

    44

    28

    Adele is useful as a classroom preparatory tool

    4

    17

    8

    38

    33

    Adele would be helpful as a class supplement

    4

    20

    4

    42

    29

    Adele is a good substitute for a class lecture

    29

    33

    21

    9

    9

    Adele provided most info of a lecturer

    12

    0

    64

    24

    0

    Adele provided most info of an attending phys

    8

    8

    56

    26

    0

    Adele is believable as an attending physician

    8

    24

    44

    24

    0

    I would like to have more cases available

    0

    8

    13

    42

    38

               

    Adele's hints are helpful

    0

    4

    33

    21

    42

    Adele's rationales were useful

    0

    4

    29

    34

    34

    I prefer Adele give rationales before a step

    4

    20

    40

    28

    8

    I prefer Adele give rationales after a step

    0

    17

    39

    35

    9

    I prefer Adele let me ask for the rationales

    4

    4

    46

    38

    8

    Adele's images and actions were motivating

    4

    17

    42

    29

    8

    Adele is preferable to a text-only tutor

    12

    32

    24

    16

    16

    I prefer a real voice to a synthesized one

    0

    12

    32

    32

    24

    Adele's unsynchronized lips-voice bother me

    4

    22

    48

    22

    4

     

    5.1 Analysis of the results

    Overwhelmingly, students thought the system was easy to use, yet some found it difficult to figure out what to do next, and how to do it. The latter finding indicates a need for more guidance than the system currently provides. We are looking to Adele to fill this role; to suggest actions at a system level, as opposed to a task level, when a user becomes confused, and to point out the importance of interface elements if they are not utilized. Individual comments ran the gamut, from frustrated students who were unable to use the system to enthusiastic users whose comments sound like testimonials. As researchers and initial users of the systems we develop, we are inclined to bias the usability of the system in favor of users like ourselves, in other words, sophisticated users. Before any system can be successfully deployed on a large scale, this bias must be remedied. If there was a consensus among the students, it was that they wanted more cases to work on. In the words of one participant, "the more practice, the better."

    The majority of students thought Adele would be a good distance education tool, and useful as a classroom preparation tool, but would not suffice as a replacement for a class lecture. They did not find Adele believable as an attending physician, a feeling that conflicted with a statement made by a fourth year student who had actually worked with attending physicians. It is also not clear if students prefer the persona to a text-only tutor. Though the numbers would indicate otherwise, the students provided favorable impressions of Adele in their general textual comments. Not surprisingly, students would have preferred a real voice to a synthesized one. Surprisingly, however, they did not mind that Adele's lips and voice were not synchronized. Adele's lips and voice move at the same time but are not yet phonetically synchronized and some people find this phenomenon disturbing. It may not be as important as we once thought.

    Almost all students agreed that Adele's hints were helpful and her rationales useful. They had mixed feelings, though, about when they wanted to hear the rationales given. We noticed during a one-on-one session that the student was not asking "Why?", and therefore missing much of the authored knowledge, so we decided to have Adele give some of the rationales automatically, whether a student asks for them or not. We implemented three variations: 1) give a rationale only when asked, 2) give it automatically after a hint, and 3) give it automatically after a user takes a step, and then asked the students which variation they preferred. Most students answered that they prefer to hear a rationale only when they ask for it, although our experience suggests that they would not ask at all if given a choice. We continue to explore these issues further.

  11. Conclusion
  12. Adele and other pedagogical agents like her have lessons to offer regarding autonomous agent design. Researchers in believable agents argue that the audience's perception of an agent's competence is more important than the competence itself [Sengers 1998], and that competence is but one of many factors that agent authors should take into account [Reilly 1997]. A key question is to what extent these claims are true for pedagogical agents. User feedback from Adele suggests that while students find her advice and feedback useful, and think she is a good learning supplement, they don't think her knowledge is equal to that of a lecturer's or attending physician's.

    Adele's task representation is being enhanced to explicitly reason with hypotheses and their likelihoods as in GUIDON [Clancey 1983]. This will allow Adele to present rationales in a more flexible manner and also automatically generate quizzes to verify the student's knowledge of the hypotheses underlying their actions. From an authoring perspective, the explicit representation of hypotheses and their relationships to findings could allow for automated generation of rationale without requiring the author to provide the rationale for every diagnostic step in a case. These limitations notwithstanding, the approach of orienting Adele's run-time knowledge base to individual cases appears to be workable, and will continue to be followed as the library of cases is developed.

    We are encouraged by the acceptance of Adele by the medical students and the positive impression she made on the medical faculty and staff, whose cooperation and support was crucial. We will be continuing our efforts in curriculum innovation in the medical domain with a larger and more formal effort beginning in the summer of 1999, which will include the development and evaluation of complete course modules aimed at multiple organ systems and student expertise levels. Meanwhile, we continue to make steps toward the adoption of the technology in other domains. In collaboration with the USC School of Dentistry, we have implemented a case for a field trial in April 1999 and are planning to author more cases for a course in the fall. We are also developing a new non-clinical simulation that will allow Adele to be employed in more general settings.

  13. Acknowledgments
  14. CARTE staff members Kate Labore and Dr. Jeff Rickel, 'A' Team members Ami Adler, Andrew Marshal, Anna Romero, and graduate student Chon Yi all contributed to the work presented here. Dr. Allan Abbott led the evaluation at the medical school and he and medical student Michael Hasler authored the case. Our collaborators, Drs. Demetrio Demetriatis, William La, Wesley Naritoku, Sidney Ontai, and Beverly Wood, and Angela Atencio and Leah Flodin at the USC School of Medicine provided indispensable assistance. This work was supported by an internal research and development grant from the USC Information Sciences Institute.

  15. References

  1. André, E., Rist, T., and Müller, J. (1998). Integrating reactive and scripted behaviors in life-like presentation agents. In K.P. Sycara and M. Wooldridge (Eds.), Proc. of the Second Int'l Conf. on Autonomous Agents, pp. 261-268, ACM Press, New York.
  2. Angros, R., Johnson, W.L., & Rickel, J. (1997). Agents that Learn to Instruct. In Proc. of the 1997 AAAI Fall Symposium on Intelligent Tutoring System Authoring Tools, TR FS-9701.
  3. Azevedo, R., Lajoie, S.P., Desaulniers, M., Fleiszer, D.M., & Bret, P.M. (1997). RadTutor: The theoretical and empirical basis for the design of a mammography interpretation tutor. In Proc. of the 8th World Conf. on Artificial Intelligence and Education, IOS Press, Amsterdam.
  4. Bell, B., and Zirkel, J. (1997), Authoring virtual gallery environments via task-specific tools. In Proc. of the 1997 AAAI Fall Symposium on Intelligent Tutoring Systems Authoring Tools, TR FS-97-01.
  5. Clancey, W. J. (1983). The Epistemology of a Rule-Based Expert System: A Framework for Explanation. Journal of Artificial Intelligence, vol. 3, pp. 215-251.
  6. Clancey, W. (1987). Knowledge-based Tutoring: The GUIDON Program. MIT Press.
  7. Gragg, C.I. (1940). Because wisdom can't be told. Harvard Alumni Bulletin, October 19, pp. 78-85.
  8. Johnson, W.L. (1998). Pedagogical agents. In Proceedings of ICCE'98, vol. 1, pp. 13-22. China Higher Education Press and Springer-Verlag, Beijing.
  9. Johnson, W.L. and Hayes-Roth, B. (1998). The First Autonomous Agents Conference. The Knowledge Engineering Review 13(2), pp. 1-6.
  10. Lester, J.C., Converse, S., Stone, B., Kahler, S., and Barlow, T. (1997). Animated pedagogical agents and problem-solving effectiveness: A large-scale empirical evaluation. In Proc. of the Eighth World Conference on Artificial Intelligence in Education, pp. 23-30. IOS Press, Amsterdam.
  11. Marsella, S.C. and Schmidt, C.F. (1990). Reactive planning using a situation space. In Proc. of the 1990 AAAI Spring Symposium Workshop on Planning.
  12. Reilly, W.S.N. (1997). A methodology for building believable social agents. In W.L. Johnson and B. Hayes-Roth (Eds.), Proc. of the First Int'l Conference on Autonomous Agents, pp. 114-121. ACM Press, New York.
  13. Rickel, J. and Johnson, W.L. (1998). Animated agents for procedural training in virtual reality: perception, cognition, and motor control. To appear in Applied Artificial Intelligence Journal.
  14. Rickel, J. and Johnson, W.L. (1997). Intelligent Tutoring in Virtual Reality: A Preliminary Report. In Proc. of the Int'l Conf. on Artificial Intelligence in Education, pp. 294-301 IOS Press, Amsterdam.
  15. Russell, S., and Norvig, P. (1995) Artificial Intelligence: A Modern Approach. Prentice Hall, Englewood Cliffs.
  16. Sengers, P., (1998). Do the Thing Right: An architecture for action-expression. In K.P. Sycara and M. Wooldridge (Eds.), Proc. of the Second Int'l Conf. on Autonomous Agents, pp. 24-31. ACM Press, NY.
  17. Shaw, E., Johnson, W.L., and Ganeshan, R. (1999). Pedagogical agents on the web. To appear in Proc. of the Third Int'l Conference on Autonomous Agents. ACM Press, New York.
  18. Towns, S.G., Callaway, C.B., Voerman, J.L., and Lester, J.C. (1998) Coherent gestures, locomotion, and speech in life-like pedagogical agents. IUI '98: 1998 International Conference on Intelligent User Interfaces, pp. 13-20, ACM Press, New York.
  19. U.S. Department of Health and Human Services, National Institutes of Health and National Library of Medicine (1998). Unified Medical Language System Knowledge Sources.
  20. Wenger, E. (1987). Artificial intelligence and tutoring systems: Computational and cognitive approaches to the communication of knowledge. Los Altos, CA: Morgan Kaufmann Publishers, Inc.
  21. Whitehead, A.N. (1929) The Aims of Education. Cambridge University.