Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 6 |
Descriptor
Source
Applied Measurement in… | 20 |
Author
Bassett, James | 1 |
Bateson, David J. | 1 |
Ben Backes | 1 |
Brian F. French | 1 |
Busch, John Christian | 1 |
Cohen, Dale J. | 1 |
Drasgow, Fritz | 1 |
Feldt, Leonard | 1 |
Forsyth, Robert A. | 1 |
Frary, Robert B. | 1 |
Geisinger, Kurt F. | 1 |
More ▼ |
Publication Type
Journal Articles | 20 |
Reports - Research | 14 |
Reports - Evaluative | 6 |
Information Analyses | 2 |
Speeches/Meeting Papers | 1 |
Education Level
Elementary Secondary Education | 3 |
High Schools | 2 |
Elementary Education | 1 |
Grade 10 | 1 |
Grade 11 | 1 |
Grade 12 | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
Grade 7 | 1 |
Intermediate Grades | 1 |
Middle Schools | 1 |
More ▼ |
Audience
Location
Canada | 1 |
Maryland | 1 |
Massachusetts | 1 |
North Carolina | 1 |
Ohio | 1 |
Texas | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Iowa Tests of Basic Skills | 1 |
Iowa Tests of Educational… | 1 |
Massachusetts Comprehensive… | 1 |
National Assessment of… | 1 |
National Teacher Examinations | 1 |
Test Anxiety Inventory | 1 |
What Works Clearinghouse Rating
Ben Backes; James Cowan – Applied Measurement in Education, 2024
We investigate two research questions using a recent statewide transition from paper to computer-based testing: first, the extent to which test mode effects found in prior studies can be eliminated; and second, the degree to which online and paper assessments offer different information about underlying student ability. We first find very small…
Descriptors: Computer Assisted Testing, Test Format, Differences, Academic Achievement
Traditional vs Intersectional DIF Analysis: Considerations and a Comparison Using State Testing Data
Tony Albano; Brian F. French; Thao Thu Vo – Applied Measurement in Education, 2024
Recent research has demonstrated an intersectional approach to the study of differential item functioning (DIF). This approach expands DIF to account for the interactions between what have traditionally been treated as separate grouping variables. In this paper, we compare traditional and intersectional DIF analyses using data from a state testing…
Descriptors: Test Items, Item Analysis, Data Use, Standardized Tests
Koran, Jennifer; Kopriva, Rebecca J. – Applied Measurement in Education, 2017
Providing appropriate test accommodations to most English language learners (ELLs) is important to facilitate meaningful inferences about learning. This study compared teacher large-scale test accommodation recommendations to those from a literature- and practitioner-grounded accommodation selection taxonomy. The taxonomy links student-specific…
Descriptors: English Language Learners, Testing Accommodations, Comparative Analysis, Taxonomy
Cohen, Dale J.; Zhang, Jin; Wothke, Werner – Applied Measurement in Education, 2019
Construct-irrelevant cognitive complexity of some items in the statewide grade-level assessments may impose performance barriers for students with disabilities who are ineligible for alternate assessments based on alternate achievement standards. This has spurred research into whether items can be modified to reduce complexity without affecting…
Descriptors: Test Items, Accessibility (for Disabled), Students with Disabilities, Low Achievement
Leighton, Jacqueline P. – Applied Measurement in Education, 2013
The Standards for Educational and Psychological Testing indicate that multiple sources of validity evidence should be used to support the interpretation of test scores. In the past decade, examinee response processes, as a source of validity evidence, have received increased attention. However, there have been relatively few methodological studies…
Descriptors: Psychological Testing, Standards, Interviews, Protocol Analysis
Taylor, Catherine S.; Lee, Yoonsun – Applied Measurement in Education, 2010
Item response theory (IRT) methods are generally used to create score scales for large-scale tests. Research has shown that IRT scales are stable across groups and over time. Most studies have focused on items that are dichotomously scored. Now Rasch and other IRT models are used to create scales for tests that include polytomously scored items.…
Descriptors: Measures (Individuals), Item Response Theory, Robustness (Statistics), Item Analysis
Miller, G. Edward; Yoes, Michael E.; Twing, Jon S. – Applied Measurement in Education, 2004
Two models are presented in this article for estimating the proportion of students who would pass all of three or more content area tests given that none have actually been tested in more than two of the content areas. The first model allows one to estimate the proportion of students who would pass all of three or more content area tests from the…
Descriptors: Scores, Standardized Tests, Student Evaluation, Testing Programs

Forsyth, Robert A.; And Others – Applied Measurement in Education, 1992
Eighth grade teachers in three local school districts helped customize two standardized norm-referenced tests for ninth graders to investigate effects of deleting some items and adding locally constructed items. Results indicate that percentile ranks for the customized tests could be very different from those for the complete test. (SLD)
Descriptors: Adaptive Testing, Comparative Testing, Elementary Secondary Education, Grade 9

Drasgow, Fritz; And Others – Applied Measurement in Education, 1996
A general approach to the identification of individuals mismeasured by a standardized psychological test is reviewed. The method, originated by M. V. Levine and F. Drasgow (1988), has the advantage of statistical optimality. Use of optimal methods requires a psychometric model for normal responding and one for aberrant responding. (SLD)
Descriptors: Identification, Item Response Theory, Measurement Techniques, Models

Roznowski, Mary; Bassett, James – Applied Measurement in Education, 1992
Current coaching practices used in training test wiseness for analogy items on standardized test batteries were investigated in a 3-group design involving about 100 undergraduates in each condition. The largest improvement came in items in the middle range of difficulty, but overall effects of coaching were important. (SLD)
Descriptors: Difficulty Level, Higher Education, Standardized Tests, Teaching Methods

Frary, Robert B. – Applied Measurement in Education, 1993
Methods for detecting copying of multiple-choice test responses are reviewed and compared with respect to their effectiveness and the practicality of their application for groups of varying sizes. Reasons why effective detection methods are seldom applied in standardized and classroom testing are discussed. (SLD)
Descriptors: Cheating, Elementary Secondary Education, Evaluation Methods, Higher Education

Linn, Robert L.; Kiplinger, Vonda L. – Applied Measurement in Education, 1995
The adequacy of linking statewide standardized test results to the National Assessment of Educational Progress by using equipercentile equating procedures was investigated using statewide mathematics data from four states. Results suggest that the linkings are not sufficiently trustworthy to make comparisons based on the tails of the distribution.…
Descriptors: Comparative Analysis, Educational Assessment, Equated Scores, Mathematics Tests

Way, Walter D.; And Others – Applied Measurement in Education, 1989
The effects of using item response theory (IRT) ability estimates based on customized tests formed by selecting areas from a nationally standardized achievement test were examined. For some populations, in some conditions, IRT ability estimates can be equivalent to scores based on full-length tests. (SLD)
Descriptors: Achievement Tests, Adaptive Testing, Content Validity, Elementary Education

Geisinger, Kurt F. – Applied Measurement in Education, 1994
Federal law requires that individuals with handicapping conditions be administered assessments in ways that accommodate their disabilities without penalizing them. Validation studies are needed to evaluate the meaning of scores resulting from nonstandard test administrations. The limited number of these studies to date is reviewed. (SLD)
Descriptors: Disabilities, Educational Assessment, Elementary School Students, Elementary Secondary Education

Hambleton, Ronald K.; Murphy, Edward – Applied Measurement in Education, 1992
The validity of several criticisms of objective tests is addressed, and the viability of some alternatives to objective testing is discussed. Evidence against multiple-choice tests is not as strong as has been claimed. Authentic assessments may not always be better, and research about new forms of assessment is necessary. (SLD)
Descriptors: Achievement Tests, Educational Assessment, Literature Reviews, Measurement Techniques
Previous Page | Next Page ยป
Pages: 1 | 2