NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20250
Since 20240
Since 2021 (last 5 years)0
Since 2016 (last 10 years)0
Since 2006 (last 20 years)6
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Showing all 11 results Save | Export
Tian, Feng – ProQuest LLC, 2011
There has been a steady increase in the use of mixed-format tests, that is, tests consisting of both multiple-choice items and constructed-response items in both classroom and large-scale assessments. This calls for appropriate equating methods for such tests. As Item Response Theory (IRT) has rapidly become mainstream as the theoretical basis for…
Descriptors: Item Response Theory, Comparative Analysis, Equated Scores, Statistical Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Armstrong, Ronald D.; Shi, Min – Journal of Educational Measurement, 2009
This article demonstrates the use of a new class of model-free cumulative sum (CUSUM) statistics to detect person fit given the responses to a linear test. The fundamental statistic being accumulated is the likelihood ratio of two probabilities. The detection performance of this CUSUM scheme is compared to other model-free person-fit statistics…
Descriptors: Probability, Simulation, Models, Psychometrics
Peer reviewed Peer reviewed
Direct linkDirect link
Newton, Paul E. – Research Papers in Education, 2010
Robert Coe has claimed that three broad conceptions of comparability can be identified from the literature: performance, statistical and conventional. Each of these he rejected, in favour of a single, integrated conception which relies upon the notion of a "linking construct" and which he termed "construct comparability".…
Descriptors: Psychometrics, Measurement Techniques, Foreign Countries, Tests
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Twissell, Adrian – Design and Technology Education, 2011
This study examines whether MidYIS and YELLIS cognitive ability tests (CATs) are appropriate methods for the identification of giftedness in Design and Technology. A key rationale for the study was whether CATs and able to identify those students with the aptitudes considered of importance to identifying giftedness in Design and Technology and…
Descriptors: Foreign Countries, Gifted, Identification, Cognitive Ability
Educational Testing Service, 2010
This document describes the breadth of the research that the ETS (Educational Testing Service) Research & Development division is conducting in 2010. This portfolio will be updated in early 2011 to reflect changes to existing projects and new projects that were added after this document was completed. The research described in this portfolio falls…
Descriptors: Portfolios (Background Materials), Testing Programs, Educational Testing, Private Agencies
Peer reviewed Peer reviewed
Brennan, Robert L. – Educational Measurement: Issues and Practice, 1997
The history of generalizability theory (G theory) is told from the perspective of one researcher's experiences, describing psychometric and scientific perspectives that influenced the development of G theory and its adoption. Work that remains to be done in the field is outlined. (SLD)
Descriptors: Educational Testing, Generalizability Theory, Measurement, Psychometrics
Educational Testing Service, 2008
This document describes the breadth of the research being conducted in 2008 by the Research and Development Division at Educational Testing Service (ETS). The research described falls into three large categories: (1) Research supported by the ETS research allocation; (2) Research funded by testing programs at ETS; and (3) Research funded by…
Descriptors: Research and Development, Testing Programs, Educational Testing, Educational Research
Peer reviewed Peer reviewed
Wang, Tianyou; Kolen, Michael J. – Journal of Educational Measurement, 2001
Reviews research literature on comparability issues in computerized adaptive testing (CAT) and synthesizes issues specific to comparability and test security. Develops a framework for evaluating comparability that contains three categories of criteria: (1) validity; (2) psychometric property/reliability; and (3) statistical assumption/test…
Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Criteria
Wilcox, Rand R. – 1982
This document contains three papers from the Methodology Project of the Center for the Study of Evaluation. Methods for characterizing test accuracy are reported in the first two papers. "Bounds on the K Out of N Reliability of a Test, and an Exact Test for Hierarchically Related Items" describes and illustrates how an extension of a…
Descriptors: Educational Testing, Evaluation Methods, Guessing (Tests), Latent Trait Theory
Page, Ellis B.; Paulus, Dieter H. – 1968
This study aimed at expanding a new field of educational measurement, by investigating the feasibility of using computer programs for the automatic analysis and evaluation of student writing. Essays written by secondary students in their English classes were rated by multiple independent judges on a number of traits usually considered important:…
Descriptors: Computer Assisted Instruction, Content Analysis, Educational Diagnosis, Educational Technology
Doherty, Margaret, Comp.; MacLatchy, Josephine, Comp. – Bureau of Education, Department of the Interior, 1924
The bibliography presented in this bulletin purports to cover the printed material issued in this country concerning intelligence and educational tests during the period from January 1, 1918 to June 30, 1922. It has been the purpose of the compilers to make the bibliography as useful to students and to practical school people as possible. To that…
Descriptors: Printed Materials, Psychological Testing, Educational Testing, Intelligence Tests