ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	5

Descriptor

Comparative Testing	10
Test Bias	10
Scores	3
Test Construction	3
Test Items	3
Achievement Tests	2
Educational Testing	2
Item Analysis	2
Reading Tests	2
Research Design	2
Responses	2
Standardized Tests	2
Ability	1
Accountability	1
Affective Behavior	1
Aptitude Tests	1
Attitude Measures	1
Behavior Disorders	1
Codes of Ethics	1
College Entrance Examinations	1
Computer Simulation	1
Confidentiality	1
Correlation	1
Criterion Referenced Tests	1
Data Analysis	1
More ▼

Source

Journal of Educational…	2
Applied Measurement in…	1
Assessment in Education:…	1
ERS Spectrum	1
Educational Measurement:…	1
Oxford Review of Education	1

Publication Type

Reports - Evaluative	10
Journal Articles	7
Speeches/Meeting Papers	2
Information Analyses	1
Opinion Papers	1

Education Level

Elementary Secondary Education	2
Elementary Education	1
Grade 3	1
Grade 5	1
Secondary Education	1

Audience

Location

United Kingdom (England)

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing all 10 results Save | Export

International Comparisons and Sensitivity to Instruction

Peer reviewed

Direct link

Wiliam, Dylan – Assessment in Education: Principles, Policy & Practice, 2008

While international comparisons such as those provided by PISA may be meaningful in terms of overall judgements about the performance of educational systems, caution is needed in terms of more fine-grained judgements. In particular it is argued that the results of PISA to draw conclusions about the quality of instruction in different systems is…

Descriptors: Test Bias, Test Construction, Comparative Testing, Evaluation

Comparisons among Designs for Equating Mixed-Format Tests in Large-Scale Assessments

Peer reviewed

Direct link

Kim, Sooyeon; Walker, Michael E.; McHale, Frederick – Journal of Educational Measurement, 2010

In this study we examined variations of the nonequivalent groups equating design for tests containing both multiple-choice (MC) and constructed-response (CR) items to determine which design was most effective in producing equivalent scores across the two tests to be equated. Using data from a large-scale exam, this study investigated the use of…

Descriptors: Measures (Individuals), Scoring, Equated Scores, Test Bias

Differentials of a State Reading Assessment: Item Functioning, Distractor Functioning, and Omission Frequency for Disability Categories

Peer reviewed

Direct link

Kato, Kentaro; Moen, Ross E.; Thurlow, Martha L. – Educational Measurement: Issues and Practice, 2009

Large data sets from a state reading assessment for third and fifth graders were analyzed to examine differential item functioning (DIF), differential distractor functioning (DDF), and differential omission frequency (DOF) between students with particular categories of disabilities (speech/language impairments, learning disabilities, and emotional…

Descriptors: Learning Disabilities, Language Impairments, Behavior Disorders, Affective Behavior

Comparability of GCSE Examinations in Different Subjects: An Application of the Rasch Model

Peer reviewed

Direct link

Coe, Robert – Oxford Review of Education, 2008

The comparability of examinations in different subjects has been a controversial topic for many years and a number of criticisms have been made of statistical approaches to estimating the "difficulties" of achieving particular grades in different subjects. This paper argues that if comparability is understood in terms of a linking…

Descriptors: Test Items, Grades (Scholastic), Foreign Countries, Test Bias

School-by-School Test Score Comparisons: Statistical Issues and Pitfalls.

Peer reviewed

Rafferty, Eileen A.; Treff, August V. – ERS Spectrum, 1994

Addresses issues faced by institutions attempting to design school profiles to meet accountability standards. Reports of high-stakes test results can be skewed by choice of statistic type (percent of students passing versus mean scores), sample bias, geographical transients, and omission errors. Administrators must look beyond "common…

Descriptors: Accountability, Achievement Tests, Comparative Testing, Elementary Secondary Education

Comparing DIF across Math and Reading/Language Arts Tests for Students Receiving a Read-Aloud Accommodation

Peer reviewed

Direct link

Bolt, Sara E.; Ysseldyke, James E. – Applied Measurement in Education, 2006

Although testing accommodations are commonly provided to students with disabilities within large-scale testing programs, research findings on how well accommodations allow for comparable measurement of student knowledge and skill remain inconclusive. The purpose of this study was to examine the extent to which 1 commonly held belief about testing…

Descriptors: Oral Reading, Testing Accommodations, Disabilities, Special Needs Students

Exact Small-Sample Differential Item Functioning Methods for Polytomous Items with Illustration Based on an Attitude Survey

Peer reviewed

Direct link

Meyer, J. Patrick; Huynh, Huynh; Seaman, Michael A. – Journal of Educational Measurement, 2004

Exact nonparametric procedures have been used to identify the level of differential item functioning (DIF) in binary items. This study explored the use of exact DIF procedures with items scored on a Likert scale. The results from an attitude survey suggest that the large-sample Cochran-Mantel-Haenszel (CMH) procedure identifies more items as…

Descriptors: Test Bias, Attitude Measures, Surveys, Predictive Validity

A Comparison of Unidimensional and Multidimensional IRT Approaches to Test Information in a Test Battery.

Download full text

Chang, Yu-Wen; Davison, Mark L. – 1992

Standard errors and bias of unidimensional and multidimensional ability estimates were compared in a factorial, simulation design with two item response theory (IRT) approaches, two levels of test correlation (0.42 and 0.63), two sample sizes (500 and 1,000), and a hierarchical test content structure. Bias and standard errors of subtest scores…

Descriptors: Comparative Testing, Computer Simulation, Correlation, Error of Measurement

A Critical Analysis of Interview, Telephone, and Mail Survey Designs.

Download full text

Katz, Elinor – 1993

A critical analysis is presented of the literature as it relates to survey research, including personal interviews, telephone interviews, and mail questionnaires. Additional research concerns are explored, and a code of ethics for survey researchers is presented. Focus groups, interviews, long interviews, telephone interviews, and mail surveys are…

Descriptors: Codes of Ethics, Comparative Testing, Confidentiality, Interviews

The Revised SAT's and the ACT's--Are They Really Different?

Download full text

McManus, Barbara Luger – 1992

This paper discusses whether or not revisions of the Scholastic Aptitude Test (SAT) and the American College Test (ACT) have created such significant differences between the two tests that a student could conceivably score significantly higher on one than the other. The SAT has been revised to meet the needs of an increasingly diverse student…

Descriptors: Ability, Achievement Tests, Aptitude Tests, College Entrance Examinations

Bolt, Sara E.	1
Chang, Yu-Wen	1
Coe, Robert	1
Davison, Mark L.	1
Huynh, Huynh	1
Kato, Kentaro	1
Katz, Elinor	1
Kim, Sooyeon	1
McHale, Frederick	1
McManus, Barbara Luger	1
Meyer, J. Patrick	1
Moen, Ross E.	1
Rafferty, Eileen A.	1
Seaman, Michael A.	1
Thurlow, Martha L.	1
Treff, August V.	1
Walker, Michael E.	1
Wiliam, Dylan	1
Ysseldyke, James E.	1
More ▼