NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20250
Since 20240
Since 2021 (last 5 years)0
Since 2016 (last 10 years)0
Since 2006 (last 20 years)8
Audience
Researchers1
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 24 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Sinharay, Sandip; Puhan, Gautam; Haberman, Shelby J. – Multivariate Behavioral Research, 2010
Diagnostic scores are of increasing interest in educational testing due to their potential remedial and instructional benefit. Naturally, the number of educational tests that report diagnostic scores is on the rise, as are the number of research publications on such scores. This article provides a critical evaluation of diagnostic score reporting…
Descriptors: Educational Testing, Scores, Reports, Psychometrics
Kim, Jihye – ProQuest LLC, 2010
In DIF studies, a Type I error refers to the mistake of identifying non-DIF items as DIF items, and a Type I error rate refers to the proportion of Type I errors in a simulation study. The possibility of making a Type I error in DIF studies is always present and high possibility of making such an error can weaken the validity of the assessment.…
Descriptors: Test Bias, Test Length, Simulation, Testing
Horst, S. Jeanne – ProQuest LLC, 2010
Despite high-stakes applications of assessment findings, assessment data are frequently collected in situations that are of low-stakes to examinees. Because low-stakes tests are of little consequence to the examinees, test-taking motivation and thus the validity of inferences drawn from unmotivated examinees' scores are of concern. The current…
Descriptors: Personality Traits, Motivation, Personality, Data Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Tormakangas, Kari – Educational Research and Evaluation, 2011
Educational achievement is a very important issue for parents, teachers, and the government. An accurate measurement plays a very important role in evaluating achievement fairly, and, therefore, analysis methods have been developed considerably in recent years. Education based on long-time learning processes forms a fruitful base for item tests,…
Descriptors: Test Items, Item Analysis, Learning Processes, Item Response Theory
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Regional Educational Laboratory Southeast, 2009
Since the passage of the No Child Left Behind Act of 2001 (2002), there has been increased interest in using student achievement data (through standardized tests) to evaluate teacher effectiveness. Two U.S. Department of Education secretaries, Secretary Spellings and Secretary Duncan, have expressed interest in growth models and the need to…
Descriptors: Evidence, Educational Research, Teacher Effectiveness, Teacher Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Sinharay, Sandip; Haberman, Shelby J. – Measurement: Interdisciplinary Research and Perspectives, 2009
In this commentary, the authors discuss some of the issues regarding the use of diagnostic classification models that practitioners should keep in mind. In the authors experience, these issues are not as well known as they should be. The authors then provide recommendations on diagnostic scoring.
Descriptors: Scoring, Reliability, Validity, Classification
Peer reviewed Peer reviewed
Direct linkDirect link
Baker, Eva L. – Educational Assessment, 2007
This article describes the history, evidence warrants, and evolution of the Center for Research on Evaluation, Standards, and Student Testing's (CRESST) model-based assessments. It considers alternative interpretations of scientific or practical models and illustrates how model-based assessment addresses both definitions. The components of the…
Descriptors: Educational Testing, Computer Assisted Testing, Validity, Test Construction
Tripp, John D.; Todd, Anne H. – 1982
A project was conducted to develop a model for evaluating placement testing in the North Carolina System of Community Colleges. Researchers at Central Piedmont Community College conducted a longitudinal study of students' progress through the college curriculum as it related to placement test scores. The following numbers of students comprised the…
Descriptors: Academic Achievement, Community Colleges, Educational Testing, Equivalency Tests
Bloomer, Corinne – Teacher, 1975
Article discussed the disadvantages of student testing as a means of evaluating student progress in the classroom and suggested the use of a new model of assessment. Three steps intended for classroom diagnosis of students were described. (RK)
Descriptors: Academic Achievement, Educational Testing, Models, Standardized Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Green, Sylvia; Oates, Tim – Educational Research, 2009
Background: In this article we address some of the challenges posed by the development of national assessment systems and discuss the need for high quality information on trends in attainment; support for school improvement processes and ways in which learning should be enhanced through valid assessment. Purpose: Key elements are explored,…
Descriptors: Educational Objectives, National Standards, Educational Quality, Educational Change
Peer reviewed Peer reviewed
Wardrop, James L.; And Others – Journal of Educational Measurement, 1982
A structure for describing different approaches to testing is generated by identifying five dimensions along which tests differ: test uses, item generation, item revision, assessment of precision, and validation. These dimensions are used to profile tests of reading comprehension. Only norm-referenced achievement tests had an inference system…
Descriptors: Achievement Tests, Comparative Analysis, Educational Testing, Models
Peer reviewed Peer reviewed
Berk, Ronald A. – Journal of Experimental Education, 1976
Attempts to select empirically the optimal cutting score or criterion level for a test based on response data from validation samples of instructed and uninstructed students. This score maximizes the probability of correct mastery-nonmastery decisions (or minimizes the probability of incorrect decisions). (Author/RK)
Descriptors: Charts, Criterion Referenced Tests, Cutting Scores, Educational Testing
Peer reviewed Peer reviewed
Wang, Tianyou; Kolen, Michael J. – Journal of Educational Measurement, 2001
Reviews research literature on comparability issues in computerized adaptive testing (CAT) and synthesizes issues specific to comparability and test security. Develops a framework for evaluating comparability that contains three categories of criteria: (1) validity; (2) psychometric property/reliability; and (3) statistical assumption/test…
Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Criteria
Mislevy, Robert J.; Almond, Russell G. – 1997
This paper synthesizes ideas from the fields of graphical modeling and education testing, particularly item response theory (IRT) applied to computerized adaptive testing (CAT). Graphical modeling can offer IRT a language for describing multifaceted skills and knowledge, and disentangling evidence from complex performances. IRT-CAT can offer…
Descriptors: Adaptive Testing, Computer Assisted Testing, Educational Testing, Higher Education
Office of Personnel Management, Washington, DC. – 1979
The stimulus for this colloquium was the convergence of several significant developments bearing on the construct validation of standardized tests and other assessment methods. Of these developments, some were fundamental to psychology as a science; others reflected socio-political pressures on measurement in education and employment. The ten…
Descriptors: Aptitude Tests, Educational Practices, Educational Testing, Employment Practices
Previous Page | Next Page ยป
Pages: 1  |  2