ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	7

Descriptor

Difficulty Level	10
Evaluation Methods	10
Test Bias	10
Test Items	8
Scores	5
Correlation	3
Factor Analysis	3
Item Analysis	3
Latent Trait Theory	3
Statistical Analysis	3
Student Evaluation	3
Test Validity	3
Comparative Analysis	2
Computer Assisted Testing	2
Educational Technology	2
Effect Size	2
Grade 4	2
Item Response Theory	2
Simulation	2
Ability Identification	1
Accountability	1
Adaptive Testing	1
Analysis of Covariance	1
Analysis of Variance	1
Biology	1
More ▼

Source

CBE - Life Sciences Education	1
ETS Research Report Series	1
Educational Testing Service	1
Educational and Psychological…	1
International Journal of…	1
Journal of Education and…	1
Journal of Educational…	1
Online Submission	1

Publication Type

Reports - Research	7
Journal Articles	6
Information Analyses	1
Numerical/Quantitative Data	1
Reports - Descriptive	1
Reports - Evaluative	1

Education Level

Elementary Secondary Education	2
Grade 4	2
Elementary Education	1
Higher Education	1
Postsecondary Education	1

Audience

Location

Colombia	1
United Kingdom (England)	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	1
Progress in International…	1

What Works Clearinghouse Rating

Showing all 10 results Save | Export

Ensuring Fairness in Difficulty and Content among Parallel Assessments Generated from a Test-Item Database

Download full text

Parry, James R. – Online Submission, 2020

This paper presents research and provides a method to ensure that parallel assessments, that are generated from a large test-item database, maintain equitable difficulty and content coverage each time the assessment is presented. To maintain fairness and validity it is important that all instances of an assessment, that is intended to test the…

Descriptors: Culture Fair Tests, Difficulty Level, Test Items, Test Validity

Examining Gender Differences in Written Assessment Tasks in Biology: A Case Study of Evolutionary Explanations

Peer reviewed

Direct link

Federer, Meghan Rector; Nehm, Ross H.; Pearl, Dennis K. – CBE - Life Sciences Education, 2016

Understanding sources of performance bias in science assessment provides important insights into whether science curricula and/or assessments are valid representations of student abilities. Research investigating assessment bias due to factors such as instrument structure, participant characteristics, and item types are well documented across a…

Descriptors: Gender Differences, Biology, Science Instruction, Case Studies

Test Anxiety, Computer-Adaptive Testing and the Common Core

Peer reviewed
PDF on ERIC

Download full text

Colwell, Nicole Makas – Journal of Education and Training Studies, 2013

This paper highlights the current findings and issues regarding the role of computer-adaptive testing in test anxiety. The computer-adaptive test (CAT) proposed by one of the Common Core consortia brings these issues to the forefront. Research has long indicated that test anxiety impairs student performance. More recent research indicates that…

Descriptors: Test Anxiety, Computer Assisted Testing, Evaluation Methods, Standardized Tests

Investigating Sources of Differential Item Functioning in International Large-Scale Assessments Using a Confirmatory Approach

Peer reviewed

Direct link

Sandilands, Debra; Oliveri, Maria Elena; Zumbo, Bruno D.; Ercikan, Kadriye – International Journal of Testing, 2013

International large-scale assessments of achievement often have a large degree of differential item functioning (DIF) between countries, which can threaten score equivalence and reduce the validity of inferences based on comparisons of group performances. It is important to understand potential sources of DIF to improve the validity of future…

Descriptors: Validity, Measures (Individuals), International Studies, Foreign Countries

Computer-Adaptive Testing for Students with Disabilities: A Review of the Literature. Research Report. ETS RR-11-32

Download full text

Stone, Elizabeth; Davey, Tim – Educational Testing Service, 2011

There has been an increased interest in developing computer-adaptive testing (CAT) and multistage assessments for K-12 accountability assessments. The move to adaptive testing has been met with some resistance by those in the field of special education who express concern about routing of students with divergent profiles (e.g., some students with…

Descriptors: Disabilities, Adaptive Testing, Accountability, Computer Assisted Testing

Summarizing Item Difficulty Variation with Parcel Scores

Peer reviewed

Direct link

Camilli, Gregory; Prowker, Adam; Dossey, John A.; Lindquist, Mary M.; Chiu, Ting-Wei; Vargas, Sadako; de la Torre, Jimmy – Journal of Educational Measurement, 2008

A new method for analyzing differential item functioning is proposed to investigate the relative strengths and weaknesses of multiple groups of examinees. Accordingly, the notion of a conditional measure of difference between two groups (Reference and Focal) is generalized to a conditional variance. The objective of this article is to present and…

Descriptors: Test Bias, National Competency Tests, Grade 4, Difficulty Level

Choice of Anchor Test in Equating. Research Report. ETS RR-06-35

Peer reviewed
PDF on ERIC

Download full text

Sinharay, Sandip; Holland, Paul – ETS Research Report Series, 2006

It is a widely held belief that anchor tests should be miniature versions (i.e., minitests), with respect to content and statistical characteristics of the tests being equated. This paper examines the foundations for this belief. It examines the requirement of statistical representativeness of anchor tests that are content representative. The…

Descriptors: Test Items, Equated Scores, Evaluation Methods, Difficulty Level

The Stability of a Partial Correlation Index for Identifying Items that Perform Differentially in Subgroups.

Peer reviewed

Stricker, Lawrence J. – Educational and Psychological Measurement, 1984

The stability was evaluated of a partial correlation index, comparisons of item characteristic curves, and comparisions of item difficulties in assessing race and sex differences in the performance of verbal items on the Graduate Record Examination Aptitude Test. All three indexes exhibited consistency in identifying the same items in different…

Descriptors: College Entrance Examinations, Comparative Analysis, Correlation, Difficulty Level

An Empirical Investigaiton of Six Methods for Examing Test Item Bias. Final Report.

Merz, William R.; Grossen, Neal E. – 1978

Six approaches to assessing test item bias were examined: transformed item difficulty, point biserial correlations, chi-square, factor analysis, one parameter item characteristic curve, and three parameter item characteristic curve. Data sets for analysis were generated by a Monte Carlo technique based on the three parameter model; thus, four…

Descriptors: Difficulty Level, Evaluation Methods, Factor Analysis, Item Analysis

Bias in Testing: A Presentation of Selected Methods.

Download full text

Merz, William R.; Rudner, Lawrence M. – 1978

A variety of terms related to test bias or test fairness have been used in a variety of ways, but in this document the "fair use of tests" is defined as equitable selection procedures by means of intact tests, and "test item bias" refers to the study of separate items with respect to the tests of which they are a part. Seven…

Descriptors: Analysis of Covariance, Analysis of Variance, Difficulty Level, Evaluation Criteria

Merz, William R.	2
Camilli, Gregory	1
Chiu, Ting-Wei	1
Colwell, Nicole Makas	1
Davey, Tim	1
Dossey, John A.	1
Ercikan, Kadriye	1
Federer, Meghan Rector	1
Grossen, Neal E.	1
Holland, Paul	1
Lindquist, Mary M.	1
Nehm, Ross H.	1
Oliveri, Maria Elena	1
Parry, James R.	1
Pearl, Dennis K.	1
Prowker, Adam	1
Rudner, Lawrence M.	1
Sandilands, Debra	1
Sinharay, Sandip	1
Stone, Elizabeth	1
Stricker, Lawrence J.	1
Vargas, Sadako	1
Zumbo, Bruno D.	1
de la Torre, Jimmy	1
More ▼