NotesFAQContact Us
Collection
Advanced
Search Tips
Source
Applied Measurement in…20
Audience
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 20 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Ben Backes; James Cowan – Applied Measurement in Education, 2024
We investigate two research questions using a recent statewide transition from paper to computer-based testing: first, the extent to which test mode effects found in prior studies can be eliminated; and second, the degree to which online and paper assessments offer different information about underlying student ability. We first find very small…
Descriptors: Computer Assisted Testing, Test Format, Differences, Academic Achievement
Peer reviewed Peer reviewed
Direct linkDirect link
Tony Albano; Brian F. French; Thao Thu Vo – Applied Measurement in Education, 2024
Recent research has demonstrated an intersectional approach to the study of differential item functioning (DIF). This approach expands DIF to account for the interactions between what have traditionally been treated as separate grouping variables. In this paper, we compare traditional and intersectional DIF analyses using data from a state testing…
Descriptors: Test Items, Item Analysis, Data Use, Standardized Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Koran, Jennifer; Kopriva, Rebecca J. – Applied Measurement in Education, 2017
Providing appropriate test accommodations to most English language learners (ELLs) is important to facilitate meaningful inferences about learning. This study compared teacher large-scale test accommodation recommendations to those from a literature- and practitioner-grounded accommodation selection taxonomy. The taxonomy links student-specific…
Descriptors: English Language Learners, Testing Accommodations, Comparative Analysis, Taxonomy
Peer reviewed Peer reviewed
Direct linkDirect link
Cohen, Dale J.; Zhang, Jin; Wothke, Werner – Applied Measurement in Education, 2019
Construct-irrelevant cognitive complexity of some items in the statewide grade-level assessments may impose performance barriers for students with disabilities who are ineligible for alternate assessments based on alternate achievement standards. This has spurred research into whether items can be modified to reduce complexity without affecting…
Descriptors: Test Items, Accessibility (for Disabled), Students with Disabilities, Low Achievement
Peer reviewed Peer reviewed
Direct linkDirect link
Leighton, Jacqueline P. – Applied Measurement in Education, 2013
The Standards for Educational and Psychological Testing indicate that multiple sources of validity evidence should be used to support the interpretation of test scores. In the past decade, examinee response processes, as a source of validity evidence, have received increased attention. However, there have been relatively few methodological studies…
Descriptors: Psychological Testing, Standards, Interviews, Protocol Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Taylor, Catherine S.; Lee, Yoonsun – Applied Measurement in Education, 2010
Item response theory (IRT) methods are generally used to create score scales for large-scale tests. Research has shown that IRT scales are stable across groups and over time. Most studies have focused on items that are dichotomously scored. Now Rasch and other IRT models are used to create scales for tests that include polytomously scored items.…
Descriptors: Measures (Individuals), Item Response Theory, Robustness (Statistics), Item Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Miller, G. Edward; Yoes, Michael E.; Twing, Jon S. – Applied Measurement in Education, 2004
Two models are presented in this article for estimating the proportion of students who would pass all of three or more content area tests given that none have actually been tested in more than two of the content areas. The first model allows one to estimate the proportion of students who would pass all of three or more content area tests from the…
Descriptors: Scores, Standardized Tests, Student Evaluation, Testing Programs
Peer reviewed Peer reviewed
Forsyth, Robert A.; And Others – Applied Measurement in Education, 1992
Eighth grade teachers in three local school districts helped customize two standardized norm-referenced tests for ninth graders to investigate effects of deleting some items and adding locally constructed items. Results indicate that percentile ranks for the customized tests could be very different from those for the complete test. (SLD)
Descriptors: Adaptive Testing, Comparative Testing, Elementary Secondary Education, Grade 9
Peer reviewed Peer reviewed
Drasgow, Fritz; And Others – Applied Measurement in Education, 1996
A general approach to the identification of individuals mismeasured by a standardized psychological test is reviewed. The method, originated by M. V. Levine and F. Drasgow (1988), has the advantage of statistical optimality. Use of optimal methods requires a psychometric model for normal responding and one for aberrant responding. (SLD)
Descriptors: Identification, Item Response Theory, Measurement Techniques, Models
Peer reviewed Peer reviewed
Roznowski, Mary; Bassett, James – Applied Measurement in Education, 1992
Current coaching practices used in training test wiseness for analogy items on standardized test batteries were investigated in a 3-group design involving about 100 undergraduates in each condition. The largest improvement came in items in the middle range of difficulty, but overall effects of coaching were important. (SLD)
Descriptors: Difficulty Level, Higher Education, Standardized Tests, Teaching Methods
Peer reviewed Peer reviewed
Frary, Robert B. – Applied Measurement in Education, 1993
Methods for detecting copying of multiple-choice test responses are reviewed and compared with respect to their effectiveness and the practicality of their application for groups of varying sizes. Reasons why effective detection methods are seldom applied in standardized and classroom testing are discussed. (SLD)
Descriptors: Cheating, Elementary Secondary Education, Evaluation Methods, Higher Education
Peer reviewed Peer reviewed
Linn, Robert L.; Kiplinger, Vonda L. – Applied Measurement in Education, 1995
The adequacy of linking statewide standardized test results to the National Assessment of Educational Progress by using equipercentile equating procedures was investigated using statewide mathematics data from four states. Results suggest that the linkings are not sufficiently trustworthy to make comparisons based on the tails of the distribution.…
Descriptors: Comparative Analysis, Educational Assessment, Equated Scores, Mathematics Tests
Peer reviewed Peer reviewed
Way, Walter D.; And Others – Applied Measurement in Education, 1989
The effects of using item response theory (IRT) ability estimates based on customized tests formed by selecting areas from a nationally standardized achievement test were examined. For some populations, in some conditions, IRT ability estimates can be equivalent to scores based on full-length tests. (SLD)
Descriptors: Achievement Tests, Adaptive Testing, Content Validity, Elementary Education
Peer reviewed Peer reviewed
Geisinger, Kurt F. – Applied Measurement in Education, 1994
Federal law requires that individuals with handicapping conditions be administered assessments in ways that accommodate their disabilities without penalizing them. Validation studies are needed to evaluate the meaning of scores resulting from nonstandard test administrations. The limited number of these studies to date is reviewed. (SLD)
Descriptors: Disabilities, Educational Assessment, Elementary School Students, Elementary Secondary Education
Peer reviewed Peer reviewed
Hambleton, Ronald K.; Murphy, Edward – Applied Measurement in Education, 1992
The validity of several criticisms of objective tests is addressed, and the viability of some alternatives to objective testing is discussed. Evidence against multiple-choice tests is not as strong as has been claimed. Authentic assessments may not always be better, and research about new forms of assessment is necessary. (SLD)
Descriptors: Achievement Tests, Educational Assessment, Literature Reviews, Measurement Techniques
Previous Page | Next Page ยป
Pages: 1  |  2