What Specifically Do We Think Students In The Quantitative Aspects Of The New Biology Should Know?

Isaac S. Kohane

ChildrenÕs Hospital Informatics Program

Division of Health Sciences and Technology, Harvard University and MIT

Harvard Partners Center for Genetics and Genomics

Motivation

The continuing debate about what constitutes the set of core competencies at the intersection of biology, computing, and medicine tends towards sweeping recommendations open to considerable interpretation. Fellows of the American College of Medical Informatics were asked to specify questions they expected their students and trainees to be able to answer. Review of the contents and categories of these questions suggests the immense breadth of the multidisciplinary educational challenge entailed. It also vividly illustrates the considerable expertise expected in each of the contributing disciplines. Finally, understanding the questions that the many constituencies expect our students to answer sharpens the decision of the tradeoffs to be made in the limited window of the formal education of our students.

Introduction

Discussions of what constitutes the discipline of bioinformatics, clinical informatics, computational biology or biomedical informatics are by their unbounded nature doomed to an inconclusive and unsatisfying outcome. Furthermore, these discussions only have the most abstract relationship to what research or educational programs are implemented by investigators. Specifications of curricula for trainees in these disciplines (which for pragmatic reasons we will denote in their entirety as biomedical computing solely within the confines of this paper and without making any claims for the appropriateness of this term) are helpful but they leave much open for interpretation and therefore a large, and perhaps desirable latitude, for specification of minimal competencies of biomedical computing students/trainees.

It was with this background that I had the pleasure of an informal discussion with one of the luminaries in biomedical computing regarding our shared and differing views of the necessary competencies that we expected of our graduate students and post-doctoral fellows. As we trod along well-worn pathways of countless similar discussions, the following question occurred:

ÒWhat question in a test would you expect/want 70% of the graduates of our training programs to be able to answer?Ó

I subsequently posed the same question to the electronic mail list of the American College of Medical Informatics[1], a nominated and elected group of nationally and internationally recognized investigators in biomedical computing. Within a week I received several replies for a total of 37 questions, which form the core of this manuscript. The purpose here is neither to claim that these are a definitive or representative set nor to be prescriptive in specifying necessary competencies. Rather the intent is to use these questions as a convenience sample on which to base further discussions for the very concrete decisions on the content of class work, seminars and other training mechanisms for the future leaders in biomedical computing.

The organization of the remainder of the paper is:

1)   Review and categorization of the questions

2)   Verbatim copies of the questions

3)   Discussion of some of the implications of this convenience sample.

As the purpose here is to promote further discussion the readers might themselves the following questions:

Would I expect my student/trainee to know the answer to the question?

If not, would I expect her to understand the question and the explicit or implicit challenges it refers to?

If I do not expect the student to answer the question, should she know which expert to ask and how best to pose the question to that expert?

Does this question relate at all to what I understand as biomedical computing?

Of course, the reader can also ask the same questions of himself or herself as they ask of their students.

1.1     Categorization

Post-hoc, I have created a small taxonomy for these questions. Again, this taxonomy is not intended to be normative or prescriptive and but rather only as a useful set of abstractions for categorizing these particular questions.

Category

Elaboration

Number of questions

Clinical application

Used in clinical care or research

22

Biology application

Used in basic biology research

21

Knowledge and/or Data Representation

Tradeoffs and optimization involved in representing data and knowledge

8

Probability

Applications of probability theory, hypothesis testing and statistics.

5

Databases

Design, implementation and performance of instances of databases.

7

Management

Working with institutions and people to engineer information and social systems.

3

Standardization

Use of shared terminologies, data models and ontologies.

3

Science Community

How to operate knowledgeably and effectively and pleasantly in the larger academic community.

3

Decision Support

Improving decision-making, avoiding mistakes/errors.

4

Translational research

Bringing knowledge gleaned in basic research to clinical relevance.

1

User Interface/visualization

Interactions with software artifacts and large data sets from the user perspective.

1

Algorithms/Machine learning

Development, optimization and testing of algorithms. Clustering, and classification and other data-driven characterizations of systems.

7

Public Policy

The way in which national and international policies affect biology, medicine and computational methods.

1

Models

Useful abbreviated descriptions of a system that can be used for diagnosis/fault analysis and/or prediction/prognosis.

3

1.2     The questions

The questions returned to me by the members of the American College of Medical Informatics are reproduced verbatim here with correction of only spelling. The terminology used in these questions varies and is not identical across questions but that is part of the inherent challenge of understanding these questions developed by experts.

1.3     Discussion

This small sample of questions confirms, if there was any doubt, the breadth of competencies expected by various scientific constituencies of students in biomedical computing.  It ranges from clinical applications to those in basic biology, and from probability theory to management pragmatics. These questions ask of the student the judgment to manage top-level decision-makers to effect technology diffusion in hide-bound healthcare systems and the knowledge and mathematical skills to understand the limitations of microarray normalization techniques and those of evolutionary models. These questions also presuppose a broad and deep system engineering and knowledge representation experience with the ability to readily program in a variety of computer languages.

Is this a reasonable set of expectations? It could be argued that for society to successfully advance our understanding of basic biology and how to translate it into improved national and international health, this full range of expertise is necessary (1) (2). For example, to translate clues from evolutionary biology into a new genomic diagnostic (3) and then to incorporate this diagnostic into routine clinical workflow (4) touches on a similarly broad set of scientific and engineering challenges. The question then arises, in this era of increasing multidisciplinary expertise: Is it necessary for one individual to have substantive knowledge of all these areas? (5) Or is it only necessary for the individual to work successfully as part of a team of investigators each with their own expertise? (6) If the former path is chosen then it may only require incremental changes to existing educational programs with introductory or survey courses in the to provide recognition of the nature of the complementary sets of expertise. Perhaps because of the natural inertia of our educational systems, or insight into the goals of our students, this is the path that has been taken by most of the current programs and the new educational programs in bioinformatics, system biology or computational biology. However, if the latter path is taken, then this will require a radical rethinking of our entire undergraduate and graduate curricula. It would require both the devising of several new courses and the mandating of the adoption of in-depth existing specialized classes in several disparate disciplines. This in turn would engender a number of Òzero-sumÓ considerations of tradeoffs in the investment of the students or fellowsÕ limited years of formal education, and investment on the part of our colleges and universities in faculty development and faculty time in these new courses.

Inspection of existing or recommended curricular content in biomedical informatics suggests that some have made the decision that specialization within one segment of the spectrum of competencies touched upon by the above questions is preferable and/or practical or most relevant to a specific constituency (7) (8).  Yet at least some programs have attempted to integrate the full breadth of biomedical computing expertise touched upon by the questions list above (9). Whether they have been successful or whether the goal is appropriate in the first instance (10) (11) will certainly be the subject of much further discussion.

References

1.      Gurwitz D, Weizman A, Rehavi M. Education: Teaching pharmacogenomics to prepare future physicians and researchers for personalized medicine. Trends Pharmacol Sci 2003;24(3):122-5.

2.      Ford JH, 2nd, Turner A, Yoshii A. Information requirements of genomics researchers from the patient clinical record. J Healthc Inf Manag 2002;16(4):56-61.

3.      Eichenbaum-Voline S, Olivier M, Jones EL, Naoumova RP, Jones B, Gau B, et al. Linkage and association between distinct variants of the APOA1/C3/A4/A5 gene cluster and familial combined hyperlipidemia. Arterioscler Thromb Vasc Biol 2004;24(1):167-74.

4.      Kaushal R, Shojania KG, Bates DW. Effects of computerized physician order entry and clinical decision support systems on medication safety: a systematic review. Arch Intern Med 2003;163(12):1409-16.

5.      Piko BF, Stempsey WE. Physicians of the future: Renaissance of polymaths? J R Soc Health 2002;122(4):233-7.

6.      Vena JE, Weiner JM. Innovative multidisciplinary research in environmental epidemiology: the challenges and needs. Int J Occup Med Environ Health 1999;12(4):353-70.

7.      Dyer BD, LeBlanc MD. Meeting report: Incorporating genomics research into undergraduate curricula. Cell Biol Educ 2002;1(4):101-4.

8.      Moffett SE, Menon AS, Meites EM, Kush S, Lin EY, Grappone T, et al. Preparing doctors for bedside computing. Lancet 2003;362(9377):86.

9.      Altman RB. The interactions between clinical informatics and bioinformatics: a case study. J Am Med Inform Assoc 2000;7(5):439-43.

10.     Kulikowski CA. The micro-macro spectrum of medical informatics challenges: from molecular medicine to transforming health care in a globalizing society. Methods Inf Med 2002;41(1):20-4.

11.     Kohane IS. Bioinformatics and clinical informatics: the imperative to collaborate [comment] [editorial]. J Am Med Inform Assoc 2000;7(5):512-6.



[1] http://www.amia.org/acmi/acmi.html


12/7/04: Since posting this note online, I have received several inquiries from students who despaired of ever being capable (or interested) in the breadth of expertise implied by the above quiz. Don't worry, most successful informaticians have less than half of the answers at their fingertips.