ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	14

Source

Educational Measurement:…

Publication Type

Journal Articles	15
Reports - Descriptive	9
Reports - Evaluative	4
Opinion Papers	1
Reports - Research	1

Education Level

Elementary Secondary Education	4
Higher Education	2
Adult Education	1
Elementary Education	1
Grade 4	1
Intermediate Grades	1
Postsecondary Education	1

Audience

Location

California	1
China	1

Laws, Policies, & Programs

Assessments and Surveys

Progress in International…

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Validation as Evaluating Desired and Undesired Effects: Insights from Cross-Classified Mixed Effects Model

Peer reviewed

Direct link

Ji, Xuejun Ryan; Wu, Amery D. – Educational Measurement: Issues and Practice, 2023

The Cross-Classified Mixed Effects Model (CCMEM) has been demonstrated to be a flexible framework for evaluating reliability by measurement specialists. Reliability can be estimated based on the variance components of the test scores. Built upon their accomplishment, this study extends the CCMEM to be used for evaluating validity evidence.…

Descriptors: Measurement, Validity, Reliability, Models

Digital Module 05: Diagnostic Measurement--The G-DINA Framework

Peer reviewed

Direct link

Ma, Wenchao; de la Torre, Jimmy – Educational Measurement: Issues and Practice, 2019

In this ITEMS module, we introduce the generalized deterministic inputs, noisy "and" gate (G-DINA) model, which is a general framework for specifying, estimating, and evaluating a wide variety of cognitive diagnosis models. The module contains a nontechnical introduction to diagnostic measurement, an introductory overview of the G-DINA…

Descriptors: Models, Classification, Measurement, Identification

Never Let a Crisis Go to Waste: Large-Scale Assessment and the Response to COVID-19

Peer reviewed

Direct link

Camara, Wayne – Educational Measurement: Issues and Practice, 2020

In early spring 2020 the vast majority of US colleges and schools closed for the year due to COVID-19 with no clear direction on when or how these institutions will reopen for in-person instruction. School closures and the associated health concerns haulted large scale admissions testing and required alternative models such as remote proctoring at…

Descriptors: Measurement, COVID-19, Pandemics, School Closing

Educational Assessment of the Post-Pandemic Age: Chinese Experiences and Trends Based on Large-Scale Online Learning

Peer reviewed

Direct link

Su, Hong – Educational Measurement: Issues and Practice, 2020

Owing to the break-out of the COVID-19 pandemic, students have to take more online learning than offline, and large-scale education assessment programs have to be suspended or postponed. How could education assessment adapt to large-scale online learning? How could the effect and safety of online assessment be improved? What role should formative…

Descriptors: Educational Assessment, Pandemics, COVID-19, Foreign Countries

Designing Knowledge-in-Use Assessments to Promote Deeper Learning

Peer reviewed

Direct link

Harris, Christopher J.; Krajcik, Joseph S.; Pellegrino, James W.; DeBarger, Angela Haydel – Educational Measurement: Issues and Practice, 2019

Contemporary views on learning highlight that deep learning occurs not simply by accumulating knowledge, but by using and applying knowledge as one engages in disciplinary activity. Increasingly, those concerned with education policy and practice are shifting priorities toward supporting deeper learning by emphasizing the importance of students'…

Descriptors: Measurement, Learning Processes, Standards, Science Education

Digital Module 10: Rasch Measurement Theory

Peer reviewed

Direct link

Wang, Jue; Engelhard, George, Jr. – Educational Measurement: Issues and Practice, 2019

In this digital ITEMS module, Dr. Jue Wang and Dr. George Engelhard Jr. describe the Rasch measurement framework for the construction and evaluation of new measures and scales. From a theoretical perspective, they discuss the historical and philosophical perspectives on measurement with a focus on Rasch's concept of specific objectivity and…

Descriptors: Item Response Theory, Evaluation Methods, Measurement, Goodness of Fit

Reliability as Argument

Peer reviewed

Direct link

Parkes, Jay – Educational Measurement: Issues and Practice, 2007

Reliability consists of both important social and scientific values and methods for evidencing those values, though in practice methods are often conflated with the values. With the two distinctly understood, a reliability argument can be made that articulates the particular reliability values most relevant to the particular measurement situation…

Descriptors: Validity, Reliability, Evaluation Methods, Measurement

Moving toward a Comprehensive Assessment System: A Framework for Considering Interim Assessments

Peer reviewed

Direct link

Perie, Marianne; Marion, Scott; Gong, Brian – Educational Measurement: Issues and Practice, 2009

Local assessment systems are being marketed as formative, benchmark, predictive, and a host of other terms. Many so-called formative assessments are not at all similar to the types of assessments and strategies studied by Black and Wiliam (1998) but instead are interim assessments. In this article, we clarify the definition and uses of interim…

Descriptors: Student Evaluation, Evaluation Methods, Educational Assessment, Formative Evaluation

Measurement, Sampling, and Equating Errors in Large-Scale Assessments

Peer reviewed

Direct link

Wu, Margaret – Educational Measurement: Issues and Practice, 2010

In large-scale assessments, such as state-wide testing programs, national sample-based assessments, and international comparative studies, there are many steps involved in the measurement and reporting of student achievement. There are always sources of inaccuracies in each of the steps. It is of interest to identify the source and magnitude of…

Descriptors: Testing Programs, Educational Assessment, Measures (Individuals), Program Effectiveness

From Evidence to Action: A Seamless Process in Formative Assessment?

Peer reviewed

Direct link

Heritage, Margaret; Kim, Jinok; Vendlinski, Terry; Herman, Joan – Educational Measurement: Issues and Practice, 2009

Based on the results of a generalizability study of measures of teacher knowledge for teaching mathematics developed at the National Center for Research on Evaluation, Standards, and Student Testing at the University of California, Los Angeles, this article provides evidence that teachers are better at drawing reasonable inferences about student…

Descriptors: Formative Evaluation, Educational Testing, Inferences, Mathematics Instruction

A Framework for Evaluating and Planning Assessments Intended to Improve Student Achievement

Peer reviewed

Direct link

Nichols, Paul D.; Meyers, Jason L.; Burling, Kelly S. – Educational Measurement: Issues and Practice, 2009

Assessments labeled as formative have been offered as a means to improve student achievement. But labels can be a powerful way to miscommunicate. For an assessment use to be appropriately labeled "formative," both empirical evidence and reasoned arguments must be offered to support the claim that improvements in student achievement can be linked…

Descriptors: Academic Achievement, Tutoring, Student Evaluation, Evaluation Methods

Implications of Evidence-Centered Design for Educational Testing

Peer reviewed

Direct link

Mislevy, Robert J.; Haertel, Geneva D. – Educational Measurement: Issues and Practice, 2006

Evidence-centered assessment design (ECD) provides language, concepts, and knowledge representations for designing and delivering educational assessments, all organized around the evidentiary argument an assessment is meant to embody. This article describes ECD in terms of layers for analyzing domains, laying out arguments, creating schemas for…

Descriptors: Educational Testing, Test Construction, Evaluation Methods, Computer Simulation

An NCME Instructional Module on Estimating Item Response Theory Models Using Markov Chain Monte Carlo Methods

Peer reviewed

Direct link

Kim, Jee-Seon; Bolt, Daniel M. – Educational Measurement: Issues and Practice, 2007

The purpose of this ITEMS module is to provide an introduction to Markov chain Monte Carlo (MCMC) estimation for item response models. A brief description of Bayesian inference is followed by an overview of the various facets of MCMC algorithms, including discussion of prior specification, sampling procedures, and methods for evaluating chain…

Descriptors: Placement, Monte Carlo Methods, Markov Processes, Measurement

Commentary: Evaluating the Validity of Formative and Interim Assessment

Peer reviewed

Direct link

Shepard, Lorrie A. – Educational Measurement: Issues and Practice, 2009

In many school districts, the pressure to raise test scores has created overnight celebrity status for formative assessment. Its powers to raise student achievement have been touted, however, without attending to the research on which these claims were based. Sociocultural learning theory provides theoretical grounding for understanding how…

Descriptors: Learning Theories, Validity, Student Evaluation, Evaluation Methods

Code of Fair Testing Practices in Education (Revised)

Peer reviewed

Direct link

Educational Measurement: Issues and Practice, 2005

A note from the Working Group of the Joint Committee on Testing Practices: The "Code of Fair Testing Practices in Education (Code)" prepared by the Joint Committee on Testing Practices (JCTP) has just been revised for the first time since its initial introduction in 1988. The revision of the Code was inspired primarily by the revision of…

Descriptors: Measurement, Psychological Testing, Test Use, Student Evaluation

Evaluation Methods	15
Measurement	15
Educational Assessment	6
Student Evaluation	6
Educational Testing	5
Formative Evaluation	5
Test Use	5
Educational Principles	4
Evaluation Criteria	4
Evaluation Utilization	4
Test Construction	4
Validity	4
Diagnostic Tests	3
Elementary Secondary Education	3
Evidence	3
Program Evaluation	3
Academic Achievement	2
Best Practices	2
COVID-19	2
Computer Software	2
Educational Improvement	2
Evaluation Research	2
Foreign Countries	2
Goodness of Fit	2
Inferences	2
More ▼

Bolt, Daniel M.	1
Burling, Kelly S.	1
Camara, Wayne	1
DeBarger, Angela Haydel	1
Engelhard, George, Jr.	1
Gong, Brian	1
Haertel, Geneva D.	1
Harris, Christopher J.	1
Heritage, Margaret	1
Herman, Joan	1
Ji, Xuejun Ryan	1
Kim, Jee-Seon	1
Kim, Jinok	1
Krajcik, Joseph S.	1
Ma, Wenchao	1
Marion, Scott	1
Meyers, Jason L.	1
Mislevy, Robert J.	1
Nichols, Paul D.	1
Parkes, Jay	1
Pellegrino, James W.	1
Perie, Marianne	1
Shepard, Lorrie A.	1
Su, Hong	1
Vendlinski, Terry	1
More ▼