ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	11

Descriptor

Reading Tests	12
Mathematics Tests	7
Grade 4	4
Scores	4
Computer Assisted Testing	3
Grade 3	3
Grade 5	3
Student Evaluation	3
Test Items	3
Writing Tests	3
Achievement Tests	2
Community Colleges	2
Comparative Testing	2
Educational Testing	2
Elementary School Students	2
Error of Measurement	2
Factor Analysis	2
Measurement Techniques	2
Models	2
Reading Achievement	2
Scaling	2
Test Bias	2
Ability	1
Academic Achievement	1
Academic Standards	1
More ▼

Source

Educational Measurement:…

Publication Type

Journal Articles	12
Reports - Research	8
Reports - Evaluative	3
Information Analyses	1
Reports - Descriptive	1

Education Level

Elementary Education	6
Grade 3	3
Grade 4	3
Grade 5	3
Intermediate Grades	3
Elementary Secondary Education	2
Higher Education	2
Postsecondary Education	2
Secondary Education	2
Grade 6	1
Grade 7	1
High Schools	1
Junior High Schools	1
Middle Schools	1
Two Year Colleges	1
More ▼

Audience

Location

Colorado	1
Kansas	1
Nebraska	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
Progress in International…	1

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Validation as Evaluating Desired and Undesired Effects: Insights from Cross-Classified Mixed Effects Model

Peer reviewed

Direct link

Ji, Xuejun Ryan; Wu, Amery D. – Educational Measurement: Issues and Practice, 2023

The Cross-Classified Mixed Effects Model (CCMEM) has been demonstrated to be a flexible framework for evaluating reliability by measurement specialists. Reliability can be estimated based on the variance components of the test scores. Built upon their accomplishment, this study extends the CCMEM to be used for evaluating validity evidence.…

Descriptors: Measurement, Validity, Reliability, Models

Mode Effects in College Admissions Testing and Differential Speededness as a Possible Explanation

Peer reviewed

Direct link

Steedle, Jeffrey T.; Cho, Young Woo; Wang, Shichao; Arthur, Ann M.; Li, Dongmei – Educational Measurement: Issues and Practice, 2022

As testing programs transition from paper to online testing, they must study mode comparability to support the exchangeability of scores from different testing modes. To that end, a series of three mode comparability studies was conducted during the 2019-2020 academic year with examinees randomly assigned to take the ACT college admissions exam on…

Descriptors: College Entrance Examinations, Computer Assisted Testing, Scores, Test Format

Are Accommodations for English Learners on State Accountability Assessments Evidence-Based? A Multistudy Systematic Review and Meta-Analysis

Peer reviewed

Direct link

Rios, Joseph A.; Ihlenfeldt, Samuel D.; Chavez, Carlos – Educational Measurement: Issues and Practice, 2020

The objectives of this two-part study were to: (a) investigate English learner (EL) accommodation practices on state accountability assessments of reading/English language arts and mathematics in grades 3-8, and (b) conduct a meta-analysis of EL accommodation effectiveness on improving test performance. Across all distinct testing programs, we…

Descriptors: Testing Accommodations, English Language Learners, Program Effectiveness, Evidence Based Practice

Quantifying Error and Uncertainty Reductions in Scaling Functions: An ITEMS Module

Peer reviewed

Direct link

Moses, Tim – Educational Measurement: Issues and Practice, 2014

This module describes and extends X-to-Y regression measures that have been proposed for use in the assessment of X-to-Y scaling and equating results. Measures are developed that are similar to those based on prediction error in regression analyses but that are directly suited to interests in scaling and equating evaluations. The regression and…

Descriptors: Scaling, Regression (Statistics), Equated Scores, Comparative Analysis

Automated Scoring of Students' Small-Group Discussions to Assess Reading Ability

Peer reviewed

Direct link

Kosh, Audra E.; Greene, Jeffrey A.; Murphy, P. Karen; Burdick, Hal; Firetto, Carla M.; Elmore, Jeff – Educational Measurement: Issues and Practice, 2018

We explored the feasibility of using automated scoring to assess upper-elementary students' reading ability through analysis of transcripts of students' small-group discussions about texts. Participants included 35 fourth-grade students across two classrooms that engaged in a literacy intervention called Quality Talk. During the course of one…

Descriptors: Computer Assisted Testing, Small Group Instruction, Group Discussion, Student Evaluation

Using State Assessments for Predicting Student Success in Dual-Enrollment College Classes

Peer reviewed

Direct link

Kingston, Neal M.; Anderson, Gretchen – Educational Measurement: Issues and Practice, 2013

Scores on state standards-based assessments are readily available and may be an appropriate alternative to traditional placement tests for assigning or accepting students into particular courses. Many community colleges do not require test scores for admissions purposes but do require some kind of placement scores for first-year English and math…

Descriptors: Dual Enrollment, Student Placement, High School Students, Scores

Grading as a Reform Effort: Do Standards-Based Grades Converge with Test Scores?

Peer reviewed

Direct link

Welsh, Megan E.; D'Agostino, Jerome V.; Kaniskan, Burcu – Educational Measurement: Issues and Practice, 2013

Standards-based progress reports (SBPRs) require teachers to grade students using the performance levels reported by state tests and are an increasingly popular report card format. They may help to increase teacher familiarity with state standards, encourage teachers to exclude nonacademic factors from grades, and/or improve communication with…

Descriptors: Grades (Scholastic), Grading, Report Cards, State Standards

The Impact of Vertical Scaling Decisions on Growth Interpretations

Peer reviewed

Direct link

Briggs, Derek C.; Weeks, Jonathan P. – Educational Measurement: Issues and Practice, 2009

Most growth models implicitly assume that test scores have been vertically scaled. What may not be widely appreciated are the different choices that must be made when creating a vertical score scale. In this paper empirical patterns of growth in student achievement are compared as a function of different approaches to creating a vertical scale.…

Descriptors: Scaling, Models, Longitudinal Studies, Academic Achievement

Differentials of a State Reading Assessment: Item Functioning, Distractor Functioning, and Omission Frequency for Disability Categories

Peer reviewed

Direct link

Kato, Kentaro; Moen, Ross E.; Thurlow, Martha L. – Educational Measurement: Issues and Practice, 2009

Large data sets from a state reading assessment for third and fifth graders were analyzed to examine differential item functioning (DIF), differential distractor functioning (DDF), and differential omission frequency (DOF) between students with particular categories of disabilities (speech/language impairments, learning disabilities, and emotional…

Descriptors: Learning Disabilities, Language Impairments, Behavior Disorders, Affective Behavior

Effects of Assigning Raters to Items

Peer reviewed

Direct link

Sykes, Robert C.; Ito, Kyoko; Wang, Zhen – Educational Measurement: Issues and Practice, 2008

Student responses to a large number of constructed response items in three Math and three Reading tests were scored on two occasions using three ways of assigning raters: single reader scoring, a different reader for each response (item-specific), and three readers each scoring a rater item block (RIB) containing approximately one-third of a…

Descriptors: Test Items, Mathematics Tests, Reading Tests, Scoring

Determining Sufficient Measurement Opportunities when Using Multiple Cut Scores

Peer reviewed

Direct link

Norman, Rebecca L.; Buckendahl, Chad W. – Educational Measurement: Issues and Practice, 2008

Many educational testing programs report examinee performance at more than two levels of proficiency. Whether these assessments have the capacity to support these multiple inferences, though, is a topic that has not been widely discussed. This study proposes a method for evaluating the minimum number of measurement opportunities for reporting…

Descriptors: Testing Programs, Student Evaluation, Educational Testing, Mathematics Achievement

Computerized Adaptive Testing with Different Groups.

Peer reviewed

Legg, Sue M.; Buhr, Dianne C. – Educational Measurement: Issues and Practice, 1992

Three computerized adaptive tests (CATs) in mathematics, reading, and writing were administered to 628 community college students to determine whether examinees of different ethnic, gender, ability, and age groups, and computer experience were differentially affected. Some differences exist; however, they do not preclude use of CATs. (SLD)

Descriptors: Ability, Adaptive Testing, Age Differences, College Students

Anderson, Gretchen	1
Arthur, Ann M.	1
Briggs, Derek C.	1
Buckendahl, Chad W.	1
Buhr, Dianne C.	1
Burdick, Hal	1
Chavez, Carlos	1
Cho, Young Woo	1
D'Agostino, Jerome V.	1
Elmore, Jeff	1
Firetto, Carla M.	1
Greene, Jeffrey A.	1
Ihlenfeldt, Samuel D.	1
Ito, Kyoko	1
Ji, Xuejun Ryan	1
Kaniskan, Burcu	1
Kato, Kentaro	1
Kingston, Neal M.	1
Kosh, Audra E.	1
Legg, Sue M.	1
Li, Dongmei	1
Moen, Ross E.	1
Moses, Tim	1
Murphy, P. Karen	1
Norman, Rebecca L.	1
More ▼