ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	6

Descriptor

Standardized Tests	20
Elementary Secondary Education	7
Achievement Tests	6
Educational Assessment	4
High School Students	4
Item Response Theory	4
State Programs	4
Test Construction	4
Test Items	4
Test Results	4
Testing Problems	4
Testing Programs	4
Comparative Analysis	3
Comparative Testing	3
Elementary School Students	3
Higher Education	3
Mathematics Tests	3
Multiple Choice Tests	3
Psychometrics	3
Scores	3
Test Validity	3
Test Wiseness	3
Adaptive Testing	2
Difficulty Level	2
Elementary Education	2
More ▼

Source

Applied Measurement in…

Publication Type

Journal Articles	20
Reports - Research	14
Reports - Evaluative	6
Information Analyses	2
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	3
High Schools	2
Elementary Education	1
Grade 10	1
Grade 11	1
Grade 12	1
Grade 4	1
Grade 5	1
Grade 7	1
Intermediate Grades	1
Middle Schools	1
Secondary Education	1
More ▼

Audience

Location

Canada	1
Maryland	1
Massachusetts	1
North Carolina	1
Ohio	1
Texas	1

Laws, Policies, & Programs

Assessments and Surveys

Iowa Tests of Basic Skills	1
Iowa Tests of Educational…	1
Massachusetts Comprehensive…	1
National Assessment of…	1
National Teacher Examinations	1
Test Anxiety Inventory	1

What Works Clearinghouse Rating

Showing 1 to 15 of 20 results Save | Export

Are Online and Paper Tests Comparable? Evidence from Statewide K-12 Tests

Peer reviewed

Direct link

Ben Backes; James Cowan – Applied Measurement in Education, 2024

We investigate two research questions using a recent statewide transition from paper to computer-based testing: first, the extent to which test mode effects found in prior studies can be eliminated; and second, the degree to which online and paper assessments offer different information about underlying student ability. We first find very small…

Descriptors: Computer Assisted Testing, Test Format, Differences, Academic Achievement

Traditional vs Intersectional DIF Analysis: Considerations and a Comparison Using State Testing Data

Peer reviewed

Direct link

Tony Albano; Brian F. French; Thao Thu Vo – Applied Measurement in Education, 2024

Recent research has demonstrated an intersectional approach to the study of differential item functioning (DIF). This approach expands DIF to account for the interactions between what have traditionally been treated as separate grouping variables. In this paper, we compare traditional and intersectional DIF analyses using data from a state testing…

Descriptors: Test Items, Item Analysis, Data Use, Standardized Tests

Framing Appropriate Accommodations in Terms of Individual Need: Examining the Fit of Four Approaches to Selecting Test Accommodations of English Language Learners

Peer reviewed

Direct link

Koran, Jennifer; Kopriva, Rebecca J. – Applied Measurement in Education, 2017

Providing appropriate test accommodations to most English language learners (ELLs) is important to facilitate meaningful inferences about learning. This study compared teacher large-scale test accommodation recommendations to those from a literature- and practitioner-grounded accommodation selection taxonomy. The taxonomy links student-specific…

Descriptors: English Language Learners, Testing Accommodations, Comparative Analysis, Taxonomy

Effects of Item Modifications on Test Accessibility for Persistently Low-Performing Students with Disabilities

Peer reviewed

Direct link

Cohen, Dale J.; Zhang, Jin; Wothke, Werner – Applied Measurement in Education, 2019

Construct-irrelevant cognitive complexity of some items in the statewide grade-level assessments may impose performance barriers for students with disabilities who are ineligible for alternate assessments based on alternate achievement standards. This has spurred research into whether items can be modified to reduce complexity without affecting…

Descriptors: Test Items, Accessibility (for Disabled), Students with Disabilities, Low Achievement

Item Difficulty and Interviewer Knowledge Effects on the Accuracy and Consistency of Examinee Response Processes in Verbal Reports

Peer reviewed

Direct link

Leighton, Jacqueline P. – Applied Measurement in Education, 2013

The Standards for Educational and Psychological Testing indicate that multiple sources of validity evidence should be used to support the interpretation of test scores. In the past decade, examinee response processes, as a source of validity evidence, have received increased attention. However, there have been relatively few methodological studies…

Descriptors: Psychological Testing, Standards, Interviews, Protocol Analysis

Stability of Rasch Scales over Time

Peer reviewed

Direct link

Taylor, Catherine S.; Lee, Yoonsun – Applied Measurement in Education, 2010

Item response theory (IRT) methods are generally used to create score scales for large-scale tests. Research has shown that IRT scales are stable across groups and over time. Most studies have focused on items that are dichotomously scored. Now Rasch and other IRT models are used to create scales for tests that include polytomously scored items.…

Descriptors: Measures (Individuals), Item Response Theory, Robustness (Statistics), Item Analysis

Estimation of the All Tests Pass Rate When No Examinee Took All of the Tests

Peer reviewed

Direct link

Miller, G. Edward; Yoes, Michael E.; Twing, Jon S. – Applied Measurement in Education, 2004

Two models are presented in this article for estimating the proportion of students who would pass all of three or more content area tests given that none have actually been tested in more than two of the content areas. The first model allows one to estimate the proportion of students who would pass all of three or more content area tests from the…

Descriptors: Scores, Standardized Tests, Student Evaluation, Testing Programs

Three Applications of Customized Testing in Local School Districts.

Peer reviewed

Forsyth, Robert A.; And Others – Applied Measurement in Education, 1992

Eighth grade teachers in three local school districts helped customize two standardized norm-referenced tests for ninth graders to investigate effects of deleting some items and adding locally constructed items. Results indicate that percentile ranks for the customized tests could be very different from those for the complete test. (SLD)

Descriptors: Adaptive Testing, Comparative Testing, Elementary Secondary Education, Grade 9

Optimal Identification of Mismeasured Individuals.

Peer reviewed

Drasgow, Fritz; And Others – Applied Measurement in Education, 1996

A general approach to the identification of individuals mismeasured by a standardized psychological test is reviewed. The method, originated by M. V. Levine and F. Drasgow (1988), has the advantage of statistical optimality. Use of optimal methods requires a psychometric model for normal responding and one for aberrant responding. (SLD)

Descriptors: Identification, Item Response Theory, Measurement Techniques, Models

Training Test-Wiseness and Flawed Item Types.

Peer reviewed

Roznowski, Mary; Bassett, James – Applied Measurement in Education, 1992

Current coaching practices used in training test wiseness for analogy items on standardized test batteries were investigated in a 3-group design involving about 100 undergraduates in each condition. The largest improvement came in items in the middle range of difficulty, but overall effects of coaching were important. (SLD)

Descriptors: Difficulty Level, Higher Education, Standardized Tests, Teaching Methods

Statistical Detection of Multiple-Choice Answer Copying: Review and Commentary.

Peer reviewed

Frary, Robert B. – Applied Measurement in Education, 1993

Methods for detecting copying of multiple-choice test responses are reviewed and compared with respect to their effectiveness and the practicality of their application for groups of varying sizes. Reasons why effective detection methods are seldom applied in standardized and classroom testing are discussed. (SLD)

Descriptors: Cheating, Elementary Secondary Education, Evaluation Methods, Higher Education

Linking Statewide Tests to the National Assessment of Educational Progress: Stability of Results.

Peer reviewed

Linn, Robert L.; Kiplinger, Vonda L. – Applied Measurement in Education, 1995

The adequacy of linking statewide standardized test results to the National Assessment of Educational Progress by using equipercentile equating procedures was investigated using statewide mathematics data from four states. Results suggest that the linkings are not sufficiently trustworthy to make comparisons based on the tails of the distribution.…

Descriptors: Comparative Analysis, Educational Assessment, Equated Scores, Mathematics Tests

IRT Ability Estimates from Customized Achievement Tests without Representative Content Sampling.

Peer reviewed

Way, Walter D.; And Others – Applied Measurement in Education, 1989

The effects of using item response theory (IRT) ability estimates based on customized tests formed by selecting areas from a nationally standardized achievement test were examined. For some populations, in some conditions, IRT ability estimates can be equivalent to scores based on full-length tests. (SLD)

Descriptors: Achievement Tests, Adaptive Testing, Content Validity, Elementary Education

Psychometric Issues in Testing Students with Disabilities.

Peer reviewed

Geisinger, Kurt F. – Applied Measurement in Education, 1994

Federal law requires that individuals with handicapping conditions be administered assessments in ways that accommodate their disabilities without penalizing them. Validation studies are needed to evaluate the meaning of scores resulting from nonstandard test administrations. The limited number of these studies to date is reviewed. (SLD)

Descriptors: Disabilities, Educational Assessment, Elementary School Students, Elementary Secondary Education

A Psychometric Perspective on Authentic Measurement.

Peer reviewed

Hambleton, Ronald K.; Murphy, Edward – Applied Measurement in Education, 1992

The validity of several criticisms of objective tests is addressed, and the viability of some alternatives to objective testing is discussed. Evidence against multiple-choice tests is not as strong as has been claimed. Authentic assessments may not always be better, and research about new forms of assessment is necessary. (SLD)

Descriptors: Achievement Tests, Educational Assessment, Literature Reviews, Measurement Techniques

Previous Page | Next Page »

Pages: 1 | 2

Bassett, James	1
Bateson, David J.	1
Ben Backes	1
Brian F. French	1
Busch, John Christian	1
Cohen, Dale J.	1
Drasgow, Fritz	1
Feldt, Leonard	1
Forsyth, Robert A.	1
Frary, Robert B.	1
Geisinger, Kurt F.	1
Hall, Bruce W.	1
Hambleton, Ronald K.	1
Harker, Jill K.	1
James Cowan	1
Kiplinger, Vonda L.	1
Kopriva, Rebecca J.	1
Koran, Jennifer	1
Lee, Yoonsun	1
Leighton, Jacqueline P.	1
Linn, Robert L.	1
Miller, G. Edward	1
Moore, William P.	1
Murphy, Edward	1
More ▼