ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	3

Descriptor

Comparative Analysis	7
Equated Scores	4
Statistical Analysis	4
Test Items	3
Accuracy	2
Cutting Scores	2
Elementary Secondary Education	2
Error of Measurement	2
Reading Tests	2
Sampling	2
Scores	2
Scoring	2
Standard Setting (Scoring)	2
Test Format	2
Ability	1
Achievement Tests	1
Computer Assisted Testing	1
Data Collection	1
Difficulty Level	1
Educational Technology	1
English (Second Language)	1
Essays	1
Evaluation Methods	1
High School Students	1
High Schools	1
More ▼

Source

ETS Research Report Series	4
Applied Measurement in…	1

Author

Livingston, Samuel A.	7
Zieky, Michael J.	2
Bonett, John	1
Casabianca, Jodi	1
Grant, Mary C.	1
Holland, Paul W.	1
Kim, Sooyeon	1
Larkin, Kevin C.	1
Liao, Chi-Wen	1
Martin, Kathleen	1
Yu, Lei	1
von Davier, Alina A.	1
More ▼

Publication Type

Reports - Research	6
Journal Articles	5
Reports - Evaluative	1
Speeches/Meeting Papers	1
Tests/Questionnaires	1

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

Praxis Series	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Methods of Linking with Small Samples in a Common-Item Design: An Empirical Comparison. Research Report. ETS RR-09-38

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Livingston, Samuel A. – ETS Research Report Series, 2009

A series of resampling studies was conducted to compare the accuracy of equating in a common item design using four different methods: chained equipercentile equating of smoothed distributions, chained linear equating, chained mean equating, and the circle-arc method. Four operational test forms, each containing more than 100 items, were used for…

Descriptors: Sampling, Sample Size, Accuracy, Test Items

Examining an Alternative to Score Equating: A Randomly Equivalent Forms Approach. Research Report. ETS RR-08-14

Peer reviewed
PDF on ERIC

Download full text

Liao, Chi-Wen; Livingston, Samuel A. – ETS Research Report Series, 2008

Randomly equivalent forms (REF) of tests in listening and reading for nonnative speakers of English were created by stratified random assignment of items to forms, stratifying on item content and predicted difficulty. The study included 50 replications of the procedure for each test. Each replication generated 2 REFs. The equivalence of those 2…

Descriptors: Equated Scores, Item Analysis, Test Items, Difficulty Level

A Comparative Study of Standard-Setting Methods.

Peer reviewed

Livingston, Samuel A.; Zieky, Michael J. – Applied Measurement in Education, 1989

The borderline group standard-setting method (BGSM), Nedelsky method (NM), and Angoff method (AM) were compared, using reading scores for 1,948 and mathematics scores for 2,191 sixth through ninth graders. The NM and AM were inconsistent with the BGSM. Passing scores were higher where students were more able. (SLD)

Descriptors: Comparative Analysis, Cutting Scores, Elementary Secondary Education, Intermediate Grades

An Evaluation of the Kernel Equating Method: A Special Study with Pseudotests Constructed from Real Test Data. Research Report. ETS RR-06-02

Peer reviewed
PDF on ERIC

Download full text

von Davier, Alina A.; Holland, Paul W.; Livingston, Samuel A.; Casabianca, Jodi; Grant, Mary C.; Martin, Kathleen – ETS Research Report Series, 2006

This study examines how closely the kernel equating (KE) method (von Davier, Holland, & Thayer, 2004a) approximates the results of other observed-score equating methods--equipercentile and linear equatings. The study used pseudotests constructed of item responses from a real test to simulate three equating designs: an equivalent groups (EG)…

Descriptors: Equated Scores, Statistical Analysis, Simulation, Tests

A Comparative Study of Standard-Setting Methods.

Download full text

Livingston, Samuel A.; Zieky, Michael J. – 1983

Four different systematic methods for selecting passing scores which differ primarily in the types of judgment they require were compared. The borderline group method and the contrasting groups method were each compared with the Nedelsky method at four schools and the Angoff method at another four schools. The Basic Skills Assessment Tests in…

Descriptors: Achievement Tests, Comparative Analysis, Cutting Scores, Elementary Secondary Education

What Combination of Sampling and Equating Methods Works Best? Revised.

Download full text

Livingston, Samuel A.; And Others – 1989

Combinations of five methods of equating test forms and two methods of selecting samples of students for equating were compared for accuracy. The two sampling methods were representative sampling from the population and matching samples on the anchor test score. The equating methods were: (1) the Tucker method; (2) the Levine method; (3) the…

Descriptors: Comparative Analysis, Data Collection, Equated Scores, High School Students

Investigating Differences in Examinee Performance between Computer-Based and Handwritten Essays. Research Report. ETS RR-04-18

Peer reviewed
PDF on ERIC

Download full text

Yu, Lei; Livingston, Samuel A.; Larkin, Kevin C.; Bonett, John – ETS Research Report Series, 2004

This study compared essay scores from paper-based and computer-based versions of a writing test for prospective teachers. Scores for essays in the paper-based version averaged nearly half a standard deviation higher than those in the computer-based version, after applying a statistical control for demographic differences between the groups of…

Descriptors: Essays, Writing (Composition), Computer Assisted Testing, Technology Uses in Education