The Relationship of Expert-System Scored Constrained Free-Response Items to Multiple-Choice and Open-Ended Items

Bennett, Randy; Rock, Donald; Braun, Henry; Frye, Douglas; Spohrer, James; Soloway, Elliot

The Relationship of Expert-System Scored Constrained Free-Response Items to Multiple-Choice and Open-Ended Items

dc.contributor.author	Bennett, Randy	en_US
dc.contributor.author	Rock, Donald	en_US
dc.contributor.author	Braun, Henry	en_US
dc.contributor.author	Frye, Douglas	en_US
dc.contributor.author	Spohrer, James	en_US
dc.contributor.author	Soloway, Elliot	en_US
dc.date.accessioned	2010-04-13T20:16:19Z
dc.date.available	2010-04-13T20:16:19Z
dc.date.issued	1990	en_US
dc.identifier.citation	Bennett, Randy; Rock, Donald; Braun, Henry; Frye, Douglas; Spohrer, James; Soloway, Elliot (1990). "The Relationship of Expert-System Scored Constrained Free-Response Items to Multiple-Choice and Open-Ended Items." Applied Psychological Measurement 14(2): 151-162. <http://hdl.handle.net/2027.42/68260>	en_US
dc.identifier.issn	0146-6216	en_US
dc.identifier.uri	https://hdl.handle.net/2027.42/68260
dc.description.abstract	This study examined the relationship of an expert- system scored constrained free-response item (requir ing the student to debug a faulty computer program) to two other item types: (1) multiple-choice and (2) free- response (requiring production of a program). Confir matory factor analysis was used to test the fit of a three-factor model to these data and to compare the fit of the model to three alternatives. These models were fit using two random-half samples, one given a faulty program containing one bug and the other a program with three bugs. A single-factor model best fit the data for the sample taking the one-bug constrained free re sponse and a two-factor model fit the data somewhat better for the second sample. In addition, the factor intercorrelations showed this item type to be highly re lated to both the free-response and multiple-choice measures.	en_US
dc.format.extent	3108 bytes
dc.format.extent	869221 bytes
dc.format.mimetype	text/plain
dc.format.mimetype	application/pdf
dc.publisher	Sage Publications	en_US
dc.subject.other	Index Terms: Artificial Intelligence	en_US
dc.subject.other	Con Structed-response Items	en_US
dc.subject.other	Expert-system Scoring	en_US
dc.subject.other	Free- Response Items	en_US
dc.subject.other	Open-ended Items.	en_US
dc.title	The Relationship of Expert-System Scored Constrained Free-Response Items to Multiple-Choice and Open-Ended Items	en_US
dc.type	Article	en_US
dc.subject.hlbsecondlevel	Education	en_US
dc.subject.hlbsecondlevel	Psychology	en_US
dc.subject.hlbtoplevel	Social Sciences	en_US
dc.description.peerreviewed	Peer Reviewed	en_US
dc.contributor.affiliationum	University of Michigan	en_US
dc.contributor.affiliationother	Educational Testing Service	en_US
dc.contributor.affiliationother	Educational Testing Service	en_US
dc.contributor.affiliationother	Educational Testing Service	en_US
dc.contributor.affiliationother	Yale University	en_US
dc.contributor.affiliationother	Yale University	en_US
dc.description.bitstreamurl	http://deepblue.lib.umich.edu/bitstream/2027.42/68260/2/10.1177_014662169001400204.pdf
dc.identifier.doi	10.1177/014662169001400204	en_US
dc.identifier.source	Applied Psychological Measurement	en_US
dc.identifier.citedreference	Ackerman, T.A., & Smith, P.L. (1988). A comparison of the information provided by essay, multiple-choice, and free-response writing tests. Applied Psychological Measurement, 12, 117-128.	en_US
dc.identifier.citedreference	Bennett, R.E. (in press). Toward intelligent assessment: An integration of constructed-response testing, artificial intelligence, and model-based measurement. In N. Frederiksen, R. J. Mislevy, & I. Bejar (Eds.), Test theory for a new generation of tests. Hillsdale NJ : Erlbaum.	en_US
dc.identifier.citedreference	Birenbaum, M., & Tatsuoka, K.K. (1987). Open-ended versus multiple-choice response formats—It does make a difference for diagnostic purposes. Applied Psychological Measurement, 11, 385-395.	en_US
dc.identifier.citedreference	Bleistein, C., Maneckshana, B., & Mc Lean, D. (1988). Test analysis: College Board Advanced Placement Examination Computer Science 3JBP (SR-88-63. Princeton NJ: Educational Testing Service.	en_US
dc.identifier.citedreference	Braun, H.I. (1988). Understanding scoring reliability: Experiments in calibrating essay readers. Journal of Educational Statistics, 13, 1-18.	en_US
dc.identifier.citedreference	Braun, H.I., Bennett, R.E., Frye, D., & Soloway, E. (in press). Scoring constructed-responses using expert systems. Journal of Educational Measurement.	en_US
dc.identifier.citedreference	College Board. (1988). Advanced Placement course description : Computer science. New York: Author.	en_US
dc.identifier.citedreference	Johnson, W.L., & Soloway, E. (1985). PROUST: An automatic debugger for Pascal programs. Byte, 10(4), 179-190.	en_US
dc.identifier.citedreference	Jöreskog, K., & Sörbom, D. (1986). PRELIS: A program for multivariate data screening and data summarization. Mooresville IN: Scientific Software, Inc.	en_US
dc.identifier.citedreference	Jöreskog, K., & Sörbom, D. (1988). LISREL 7: A guide to the program and applications. Chicago: SPSS, Inc.	en_US
dc.identifier.citedreference	Loehlin, J.C. (1987). Latent variable models. Hillsdale NJ: Erlbaum.	en_US
dc.identifier.citedreference	Lord, F.M. (1980). Applications of item response theory to practical testing problems. Hillsdale NJ: Erlbaum.	en_US
dc.identifier.citedreference	Marsh, H.W., & Hocevar, D. (1985). Application of confirmatory factor analysis to the study of self-concept : First and higher order factor models and their invariance across groups. Psychological Bulletin, 97, 562-582.	en_US
dc.identifier.citedreference	Mazzeo, J., & Bleistein, C. (1986). Test analysis: College Board Advanced Placement Examination Computer Science 3IBP (SR-86-105. Princeton NJ: Educational Testing Service.	en_US
dc.identifier.citedreference	Mazzeo, J., & Flesher, R. (1985). Test analysis: College Board Advanced Placement Examination Computer Science 3HBP (SR-85-180. Princeton NJ: Educational Testing Service.	en_US
dc.identifier.citedreference	Sobel, M.E., & Bohrnstedt, G.W. (1985). Use of null models in evaluating the fit of covariance structure models. In N. B. Tuma (Ed.), Sociological methodology (pp. 152-178). San Francisco CA: Jossey-Bass.	en_US
dc.identifier.citedreference	Spohrer, J.C. (1989). MARCEL: A generate-test-and-debug (GTD) impasse/repair model of student programmers (CSD/RR No. 687. New Haven CT: Yale University, Department of Computer Science.	en_US
dc.identifier.citedreference	Sternberg, R.J. (1980). Factor theories of intelligence are all right almost. Educational Researcher, 9, 6-18.	en_US
dc.identifier.citedreference	Traub, R.E., & Fisher, C.W. (1977). On the equivalence of constructed-response and multiple-choice tests. Applied Psychological Measurement, 1, 355-369.	en_US
dc.identifier.citedreference	Tucker, L.R., & Lewis, C. (1973). A reliability coefficient for maximum likelihood factor analysis. Psychometrika, 38, 1-10.	en_US
dc.identifier.citedreference	Ward, W.C. (1982). A comparison of free-response and multiple-choice forms of verbal aptitude tests. Applied Psychological Measurement, 6, 1-11.	en_US
dc.identifier.citedreference	Ward, W.C., Frederiksen, N., & Carlson, S.B. (1980). Construct validity of free-response and machine-scorable forms of a test. Journal of Educational Measurement, 17, 11-29.	en_US
dc.owningcollname	Interdisciplinary and Peer-Reviewed

Files in this item

Name:: 10.1177_014662169001400204.pdf
Size:: 848.8KB
Format:: PDF

View/Open

Interdisciplinary and Peer-Reviewed

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.