Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 2 |
Descriptor
Scoring | 5 |
Statistical Analysis | 5 |
Test Length | 5 |
Item Analysis | 2 |
Scores | 2 |
Simulation | 2 |
Accuracy | 1 |
Adaptive Testing | 1 |
Classification | 1 |
Comparative Analysis | 1 |
Computation | 1 |
More ▼ |
Author
Baba, Kyoko | 1 |
Bauer, Ernest A. | 1 |
Cumming, Alister | 1 |
Deng, Nina | 1 |
Eouanzoui, Keanre | 1 |
Erdosy, Usman | 1 |
Harris, Dickie A. | 1 |
James, Mark | 1 |
Kantor, Robert | 1 |
Livingston, Samuel A. | 1 |
Penell, Roger J. | 1 |
More ▼ |
Publication Type
Reports - Research | 4 |
Dissertations/Theses -… | 1 |
Journal Articles | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Audience
Researchers | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Deng, Nina – ProQuest LLC, 2011
Three decision consistency and accuracy (DC/DA) methods, the Livingston and Lewis (LL) method, LEE method, and the Hambleton and Han (HH) method, were evaluated. The purposes of the study were: (1) to evaluate the accuracy and robustness of these methods, especially when their assumptions were not well satisfied, (2) to investigate the "true"…
Descriptors: Item Response Theory, Test Theory, Computation, Classification
Livingston, Samuel A. – 1984
Much previously published material for estimating the reliability of classification has been based on the assumption that a test consists of a known number of equally weighted items. The test score is the number of those items answered correctly. These methods cannot be used with classifications based on weighted composite scores, especially if…
Descriptors: Equated Scores, Essay Tests, Estimation (Mathematics), Mathematical Models
Harris, Dickie A.; Penell, Roger J. – 1977
This study used a series of simulations to answer questions about the efficacy of adaptive testing raised by empirical studies. The first study showed that for reasonable high entry points, parameters estimated from paper-and-pencil test protocols cross-validated remarkably well to groups actually tested at a computer terminal. This suggested that…
Descriptors: Adaptive Testing, Computer Assisted Testing, Cost Effectiveness, Difficulty Level
Cumming, Alister; Kantor, Robert; Baba, Kyoko; Eouanzoui, Keanre; Erdosy, Usman; James, Mark – ETS Research Report Series, 2006
We assessed whether and how the discourse written for prototype integrated tasks (involving writing in response to print or audio source texts) field tested for the new TOEFL® differs from the discourse written for independent essays (i.e., the TOEFL essay). We selected 216 compositions written for 6 tasks by 36 examinees in a field…
Descriptors: Discourse Analysis, Essays, Scores, Language Proficiency
Slawski, Edward J.; Bauer, Ernest A. – 1978
A new method of analysis was used in the Michigan Educational Assessment Program to test minimum competencies in fourth grade reading achievement. This technique permitted a substantial decrease in testing time and costs. The original test consisted of 95 items measuring 19 objectives; mastery was indicated by correct responses to four out of the…
Descriptors: Educational Assessment, Educational Objectives, Educational Testing, Grade 4