ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	2

Descriptor

Scoring	5
Statistical Analysis	5
Test Length	5
Item Analysis	2
Scores	2
Simulation	2
Accuracy	1
Adaptive Testing	1
Classification	1
Comparative Analysis	1
Computation	1
Computer Assisted Testing	1
Computer Software	1
Cost Effectiveness	1
Difficulty Level	1
Discourse Analysis	1
Educational Assessment	1
Educational Objectives	1
Educational Testing	1
English (Second Language)	1
Equated Scores	1
Essay Tests	1
Essays	1
Estimation (Mathematics)	1
Factor Structure	1
More ▼

Source

ETS Research Report Series	1
ProQuest LLC	1

Author

Baba, Kyoko	1
Bauer, Ernest A.	1
Cumming, Alister	1
Deng, Nina	1
Eouanzoui, Keanre	1
Erdosy, Usman	1
Harris, Dickie A.	1
James, Mark	1
Kantor, Robert	1
Livingston, Samuel A.	1
Penell, Roger J.	1
Slawski, Edward J.	1
More ▼

Publication Type

Reports - Research	4
Dissertations/Theses -…	1
Journal Articles	1
Speeches/Meeting Papers	1

Education Level

Audience

Researchers

Location

Canada	1
Michigan	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…

What Works Clearinghouse Rating

Showing all 5 results Save | Export

Evaluating IRT- and CTT-Based Methods of Estimating Classification Consistency and Accuracy Indices from Single Administrations

Direct link

Deng, Nina – ProQuest LLC, 2011

Three decision consistency and accuracy (DC/DA) methods, the Livingston and Lewis (LL) method, LEE method, and the Hambleton and Han (HH) method, were evaluated. The purposes of the study were: (1) to evaluate the accuracy and robustness of these methods, especially when their assumptions were not well satisfied, (2) to investigate the "true"…

Descriptors: Item Response Theory, Test Theory, Computation, Classification

Estimating the Reliability of Classifications Based on Composite Scores.

Download full text

Livingston, Samuel A. – 1984

Much previously published material for estimating the reliability of classification has been based on the assumption that a test consists of a known number of equally weighted items. The test score is the number of those items answered correctly. These methods cannot be used with classifications based on weighted composite scores, especially if…

Descriptors: Equated Scores, Essay Tests, Estimation (Mathematics), Mathematical Models

Simulated and Empirical Studies of Flexilevel Testing in Air Force Technical Training Courses. Final Report for Period 1 May 1975-30 April 1977.

Harris, Dickie A.; Penell, Roger J. – 1977

This study used a series of simulations to answer questions about the efficacy of adaptive testing raised by empirical studies. The first study showed that for reasonable high entry points, parameters estimated from paper-and-pencil test protocols cross-validated remarkably well to groups actually tested at a computer terminal. This suggested that…

Descriptors: Adaptive Testing, Computer Assisted Testing, Cost Effectiveness, Difficulty Level

Analysis of Discourse Features and Verification of Scoring Levels for Independent and Integrated Prototype Written Tasks for the New TOEFL®. TOEFL® Monograph Series. MS-30. ETS RM-05-13

Peer reviewed
PDF on ERIC

Download full text

Cumming, Alister; Kantor, Robert; Baba, Kyoko; Eouanzoui, Keanre; Erdosy, Usman; James, Mark – ETS Research Report Series, 2006

We assessed whether and how the discourse written for prototype integrated tasks (involving writing in response to print or audio source texts) field tested for the new TOEFL® differs from the discourse written for independent essays (i.e., the TOEFL essay). We selected 216 compositions written for 6 tasks by 36 examinees in a field…

Descriptors: Discourse Analysis, Essays, Scores, Language Proficiency

Reducing Testing Time While Preserving Test Information: A Ten Item Fourth Grade MEAP Reading Test.

Download full text

Slawski, Edward J.; Bauer, Ernest A. – 1978

A new method of analysis was used in the Michigan Educational Assessment Program to test minimum competencies in fourth grade reading achievement. This technique permitted a substantial decrease in testing time and costs. The original test consisted of 95 items measuring 19 objectives; mastery was indicated by correct responses to four out of the…

Descriptors: Educational Assessment, Educational Objectives, Educational Testing, Grade 4