ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	9

Source

Applied Measurement in…

Publication Type

Journal Articles	9
Reports - Research	6
Reports - Descriptive	2
Reports - Evaluative	1

Education Level

Grade 8	9
Elementary Secondary Education	5
Middle Schools	4
Grade 4	3
Grade 5	3
High Schools	3
Junior High Schools	3
Secondary Education	3
Elementary Education	2
Grade 3	2
Grade 11	1
Grade 12	1
Grade 6	1
Grade 7	1
Grade 9	1
More ▼

Audience

Location

California	1
Canada	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 9 results Save | Export

An Experimental Test of Student Verbal Reports and Teacher Evaluations as a Source of Validity Evidence for Test Development

Peer reviewed

Direct link

Leighton, Jacqueline P.; Heffernan, Colleen; Cor, M. Kenneth; Gokiert, Rebecca J.; Cui, Ying – Applied Measurement in Education, 2011

The "Standards for Educational and Psychological Testing" indicate that test instructions, and by extension item objectives, presented to examinees should be sufficiently clear and detailed to help ensure that they respond as developers intend them to respond (Standard 3.20; AERA, APA, & NCME, 1999). The present study investigates…

Descriptors: Test Construction, Validity, Evidence, Science Tests

Measurement Properties of Two Innovative Item Formats in a Computer-Based Test

Peer reviewed

Direct link

Wan, Lei; Henly, George A. – Applied Measurement in Education, 2012

Many innovative item formats have been proposed over the past decade, but little empirical research has been conducted on their measurement properties. This study examines the reliability, efficiency, and construct validity of two innovative item formats--the figural response (FR) and constructed response (CR) formats used in a K-12 computerized…

Descriptors: Test Items, Test Format, Computer Assisted Testing, Measurement

Accessible Reading Assessments for Students with Disabilities: Summary and Conclusions

Peer reviewed

Direct link

Wise, Lauress L. – Applied Measurement in Education, 2010

The articles in this special issue make two important contributions to our understanding of the impact of accommodations on test score validity. First, they illustrate a variety of methods for collection and rigorous analyses of empirical data that can supplant expert judgment of the impact of accommodations. These methods range from internal…

Descriptors: Reading Achievement, Educational Assessment, Test Reliability, Learning Disabilities

The Effects of Glossary and Read-Aloud Accommodations on English Language Learners' Performance on a Mathematics Assessment

Peer reviewed

Direct link

Wolf, Mikyung Kim; Kim, Jinok; Kao, Jenny – Applied Measurement in Education, 2012

Glossary and reading aloud test items are commonly allowed in many states' accommodation policies for English language learner (ELL) students for large-scale mathematics assessments. However, little research is available regarding the effects of these accommodations on ELL students' performance. Further, no research exists that examines how…

Descriptors: Testing Accommodations, Glossaries, Reading Aloud to Others, Validity

Accessibility of Segmented Reading Comprehension Passages for Students with Disabilities

Peer reviewed

Direct link

Abedi, Jamal; Kao, Jenny C.; Leon, Seth; Mastergeorge, Ann M.; Sullivan, Lisa; Herman, Joan; Pope, Rita – Applied Measurement in Education, 2010

This study explores factors that affect the accessibility of reading comprehension assessments for students with disabilities in grade 8 public school classrooms. The study consisted of assessing students using reading comprehension passages that were broken down into shorter "segments" or "chunks" in order to assess the…

Descriptors: Reading Achievement, Educational Strategies, Recall (Psychology), Reading Comprehension

Examining the Effectiveness of Test Accommodation Using DIF and a Mixture IRT Model

Peer reviewed

Direct link

Cho, Hyun-Jeong; Lee, Jaehoon; Kingston, Neal – Applied Measurement in Education, 2012

This study examined the validity of test accommodation in third-eighth graders using differential item functioning (DIF) and mixture IRT models. Two data sets were used for these analyses. With the first data set (N = 51,591) we examined whether item type (i.e., story, explanation, straightforward) or item features were associated with item…

Descriptors: Testing Accommodations, Test Bias, Item Response Theory, Validity

Prologue: An Introduction to the Evaluation of NAEP

Peer reviewed

Direct link

Lane, Suzanne; Zumbo, Bruno D.; Abedi, Jamal; Benson, Jeri; Dossey, John; Elliott, Stephen N.; Kane, Michael; Linn, Robert; Paredes-Ziker, Cindy; Rodriguez, Michael; Schraw, Gregg; Slattery, Jean; Thomas, Veronica; Willhoft, Joe – Applied Measurement in Education, 2009

Given the changing landscape of educational accountability at the local, state, and national levels, and the changes in the uses of the National Assessment of Educational Progress (NAEP), including the evolving uses of NAEP as a policy tool to interpret state assessment and accountability systems, an explicit statement of the current and potential…

Descriptors: National Competency Tests, Academic Achievement, Accountability, Test Validity

Application of Generalizability Theory to Concept Map Assessment Research

Peer reviewed

Direct link

Yin, Yue; Shavelson, Richard J. – Applied Measurement in Education, 2008

In the first part of this article, the use of Generalizability (G) theory in examining the dependability of concept map assessment scores and designing a concept map assessment for a particular practical application is discussed. In the second part, the application of G theory is demonstrated by comparing the technical qualities of two frequently…

Descriptors: Generalizability Theory, Concept Mapping, Validity, Reliability

The Critical Role of Anchor Paper Selection in Writing Assessment

Peer reviewed

Direct link

Osborn Popp, Sharon E.; Ryan, Joseph M.; Thompson, Marilyn S. – Applied Measurement in Education, 2009

Scoring rubrics are routinely used to evaluate the quality of writing samples produced for writing performance assessments, with anchor papers chosen to represent score points defined in the rubric. Although the careful selection of anchor papers is associated with best practices for scoring, little research has been conducted on the role of…

Descriptors: Writing Evaluation, Scoring Rubrics, Selection, Scoring

Validity	7
Grade 8	5
Test Items	4
Academic Accommodations…	2
Difficulty Level	2
Disabilities	2
Educational Assessment	2
Grade 3	2
Grade 5	2
Item Response Theory	2
Mathematics Tests	2
Reading Achievement	2
Reliability	2
Science Tests	2
Scores	2
Test Format	2
Test Reliability	2
Test Validity	2
Testing Accommodations	2
Academic Ability	1
Academic Achievement	1
Accountability	1
Achievement Tests	1
Benchmarking	1
Classrooms	1
More ▼

Abedi, Jamal	2
Benson, Jeri	1
Cho, Hyun-Jeong	1
Cor, M. Kenneth	1
Cui, Ying	1
Dossey, John	1
Elliott, Stephen N.	1
Gokiert, Rebecca J.	1
Heffernan, Colleen	1
Henly, George A.	1
Herman, Joan	1
Kane, Michael	1
Kao, Jenny	1
Kao, Jenny C.	1
Kim, Jinok	1
Kingston, Neal	1
Lane, Suzanne	1
Lee, Jaehoon	1
Leighton, Jacqueline P.	1
Leon, Seth	1
Linn, Robert	1
Mastergeorge, Ann M.	1
Osborn Popp, Sharon E.	1
Paredes-Ziker, Cindy	1
Pope, Rita	1
More ▼