ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	10
Since 2017 (last 10 years)	22
Since 2007 (last 20 years)	124

Descriptor

Evaluation Methods	178
Psychometrics	178
Educational Assessment	54
Student Evaluation	50
Measurement Techniques	49
Test Construction	49
Testing	47
Measurement	45
Educational Testing	40
Computer Assisted Testing	39
Test Validity	34
Testing Problems	33
Test Items	32
Evaluation Problems	31
Evaluation Research	31
Foreign Countries	30
Models	30
Item Response Theory	29
Comparative Analysis	25
Test Reliability	22
Classification	20
Scores	20
Psychological Testing	18
Test Interpretation	18
Item Analysis	16
More ▼

Publication Type

Journal Articles	142
Reports - Research	56
Reports - Descriptive	39
Reports - Evaluative	37
Opinion Papers	31
Information Analyses	11
Speeches/Meeting Papers	8
Books	3
Tests/Questionnaires	3
Collected Works - Proceedings	2
Dissertations/Theses -…	2
Guides - General	2
Reports - General	2
Collected Works - General	1
Guides - Classroom - Learner	1
Guides - Non-Classroom	1
Numerical/Quantitative Data	1
More ▼

Education Level

Elementary Secondary Education	35
Higher Education	28
Postsecondary Education	15
Elementary Education	11
Secondary Education	7
Early Childhood Education	5
Adult Education	4
Grade 4	4
Middle Schools	4
Grade 6	3
Grade 5	2
Grade 8	2
High Schools	2
Intermediate Grades	2
Junior High Schools	2
Grade 10	1
Grade 3	1
Grade 7	1
Grade 9	1
Kindergarten	1
Preschool Education	1
More ▼

Audience

Practitioners	5
Researchers	4
Counselors	2
Students	1

Location

United Kingdom	7
Australia	6
United States	4
Germany	3
United Kingdom (England)	3
Connecticut	2
Florida	2
Massachusetts	2
Netherlands	2
Spain	2
Taiwan	2
United Kingdom (Wales)	2
Canada	1
Colombia	1
Congo	1
Dominica	1
Egypt	1
Grenada	1
India	1
Israel	1
Kentucky	1
Lebanon	1
Malaysia	1
Michigan	1
Mississippi	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	5
Education of the Handicapped…	1
Individuals with Disabilities…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 178 results Save | Export

Stopping Rules for Computer Adaptive Testing When Item Banks Have Nonuniform Information

Peer reviewed

Direct link

Morris, Scott B.; Bass, Michael; Howard, Elizabeth; Neapolitan, Richard E. – International Journal of Testing, 2020

The standard error (SE) stopping rule, which terminates a computer adaptive test (CAT) when the "SE" is less than a threshold, is effective when there are informative questions for all trait levels. However, in domains such as patient-reported outcomes, the items in a bank might all target one end of the trait continuum (e.g., negative…

Descriptors: Computer Assisted Testing, Adaptive Testing, Item Banks, Item Response Theory

Using Item Response Models and Analysis to Address Practical Measurement Questions

Direct link

Weicong Lyu – ProQuest LLC, 2023

Item response theory (IRT) is currently the dominant methodological paradigm in educational and psychological measurement. IRT models are based on assumptions about the relationship between latent traits and observed responses, so the accuracy of the methodology depends heavily on the reasonableness of these assumptions. This dissertation consists…

Descriptors: Item Response Theory, Educational Assessment, Psychological Testing, Psychometrics

Evaluating Methods for Assessing Model Fit in Diagnostic Classification Models

Peer reviewed
PDF on ERIC

Download full text

W. Jake Thompson – Grantee Submission, 2024

Diagnostic classification models (DCMs) are psychometric models that can be used to estimate the presence or absence of psychological traits, or proficiency on fine-grained skills. Critical to the use of any psychometric model in practice, including DCMs, is an evaluation of model fit. Traditionally, DCMs have been estimated with maximum…

Descriptors: Bayesian Statistics, Classification, Psychometrics, Goodness of Fit

Resolving and Re-Scoring Constructed Response Items in Mixed-Format Assessments: An Exploration of Three Approaches

Peer reviewed

Direct link

Stefanie A. Wind; Yangmeng Xu – Educational Assessment, 2024

We explored three approaches to resolving or re-scoring constructed-response items in mixed-format assessments: rater agreement, person fit, and targeted double scoring (TDS). We used a simulation study to consider how the three approaches impact the psychometric properties of student achievement estimates, with an emphasis on person fit. We found…

Descriptors: Interrater Reliability, Error of Measurement, Evaluation Methods, Examiners

Using Multilabel Neural Network to Score High-Dimensional Assessments for Different Use Foci: An Example with College Major Preference Assessment

Peer reviewed

Direct link

Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025

Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…

Descriptors: Tests, Testing, Scores, Test Construction

Evidence-Based Assessment in Special Education Research: Advancing the Use of Evidence in Assessment Tools and Empirical Processes

Peer reviewed
PDF on ERIC

Download full text

Direct link

Elizabeth Talbott; Andres De Los Reyes; Devin M. Kearns; Jeannette Mancilla-Martinez; Mo Wang – Exceptional Children, 2023

Evidence-based assessment (EBA) requires that investigators employ scientific theories and research findings to guide decisions about what domains to measure, how and when to measure them, and how to make decisions and interpret results. To implement EBA, investigators need high-quality assessment tools along with evidence-based processes. We…

Descriptors: Evidence Based Practice, Evaluation Methods, Special Education, Educational Research

Practices in Instrument Use and Development in "Chemistry Education Research and Practice" 2010-2021

Peer reviewed

Direct link

Lazenby, Katherine; Tenney, Kristin; Marcroft, Tina A.; Komperda, Regis – Chemistry Education Research and Practice, 2023

Assessment instruments that generate quantitative data on attributes (cognitive, affective, behavioral, "etc.") of participants are commonly used in the chemistry education community to draw conclusions in research studies or inform practice. Recently, articles and editorials have stressed the importance of providing evidence for the…

Descriptors: Chemistry, Periodicals, Journal Articles, Science Education

The Use of FITNESSGRAM® in PETE: Is It Appropriate?

Peer reviewed

Direct link

Blackshear, Tara B. – International Journal of Kinesiology in Higher Education, 2022

Many physical education teacher education (PETE) programs have adopted FITNESSGRAM® as the preferred method to assess teacher candidate fitness levels. The rationale, however, is unclear. This article presents fitness testing results of PETE candidates using FITNESSGRAM® with the aim to evaluate its appropriateness. 86 PETE students participated…

Descriptors: Physical Education Teachers, Teacher Education Programs, Physical Fitness, Preservice Teachers

User-Informed Principles: Developing Assessments for All Early Learners. Measures for Early Success. Target Product Profile

Download full text

Hsueh, JoAnn; Portilla, Ximena; McCormick, Meghan; Balu, Rekha; Najafi, Behnosh – MDRC, 2022

The Measures for Early Success Initiative aims to reimagine the landscape of early learning assessments for the millions of 3- to 5-year-olds enrolled in Pre-K, so that more equitable data can be applied to meaningfully support and strengthen early learning experiences for all young children. This document outlines design parameters for child…

Descriptors: Early Childhood Education, Preschool Children, Student Evaluation, Child Development

BGU-MF: Ben-Gurion University Math Fluency Test

Peer reviewed

Direct link

Gliksman, Yarden; Berebbi, Shir; Hershman, Ronen; Henik, Avishai – Applied Cognitive Psychology, 2022

Math fluency (MF) is the ability to quickly and accurately solve simple math exercises. Proficiency in MF is one of the buildings of arithmetic achievement during school. However, so far only paper and pencil tests have been used to assess MF. In the current study, we present the BGU-MF (Ben-Gurion University Math Fluency) test, a new computerized…

Descriptors: Foreign Countries, Mathematics Skills, Mathematics Tests, Computer Assisted Testing

A Systematic Review of Emotion Regulation Assessments in US Schools: Bridging the Gap between Researchers and Educators

Peer reviewed

Direct link

Ng, Zi Jia; Willner, Cynthia J.; Mannweiler, Morgan D.; Hoffmann, Jessica D.; Bailey, Craig S.; Cipriano, Christina – Educational Psychology Review, 2022

Many emotion regulation assessments have been developed for research purposes, but few are frequently used in schools despite the rapid growth of social and emotional learning programs with an explicit focus on emotion regulation in schools. This systematic review provides an overview of emotion regulation assessments that have been utilized with…

Descriptors: Emotional Response, Self Control, Elementary School Students, Secondary School Students

Classroom Assessment and Large-Scale Psychometrics: Shall the Twain Meet? (A Conversation with Margaret Heritage and Neal Kingston)

Peer reviewed

Direct link

Heritage, Margaret; Kingston, Neal M. – Journal of Educational Measurement, 2019

Classroom assessment and large-scale assessment have, for the most part, existed in mutual isolation. Some experts have felt this is for the best and others have been concerned that the schism limits the potential contribution of both forms of assessment. Margaret Heritage has long been a champion of best practices in classroom assessment. Neal…

Descriptors: Measurement, Psychometrics, Context Effect, Classroom Environment

Development of Procedures to Assess Problem-Solving Competence in Computing Engineering

Peer reviewed

Direct link

Pérez, Jorge; Vizcarro, Carmen; García, Javier; Bermúdez, Aurelio; Cobos, Ruth – IEEE Transactions on Education, 2017

In the context of higher education, a competence may be understood as the combination of skills, knowledge, attitudes, values, and abilities that underpin effective and/or superior performance in a professional area. The aim of the work reported here was to design a set of procedures to assess a transferable competence, i.e., problem solving, that…

Descriptors: Problem Solving, Computer Science Education, Minimum Competency Testing, Competency Based Education

The Future Value of Serious Games for Assessment: Where Do We Go Now?

Peer reviewed

Direct link

de Klerk, Sebastiaan; Kato, Pamela M. – Journal of Applied Testing Technology, 2017

Game-based assessments will most likely be an increasing part of testing programs in future generations because they provide promising possibilities for more valid and reliable measurement of students' skills as compared to the traditional methods of assessment like paper-and-pencil tests or performance-based assessments. The current status of…

Descriptors: Futures (of Society), Educational Games, Testing, Educational Benefits

Serious Games for Assessment: Welcome to the Jungle

Peer reviewed

Direct link

Kato, Pamela M.; de Klerk, Sebastiaan – Journal of Applied Testing Technology, 2017

Serious games are increasingly being explored for use as assessment tools in broad domains. Drawing from research in these domains, we present important advantages and challenges that arise when using games for assessment. In light of this context and as an introduction to this special issue on Serious Games and Assessments, we introduce the…

Descriptors: Evaluation Methods, Formative Evaluation, Design, Educational Games

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12

Measurement:…	24
International Journal of…	12
Journal of Applied Testing…	10
Journal of Educational…	7
Studies in Educational…	4
Educational and Psychological…	3
Applied Psychological…	2
Assessing Writing	2
Chemistry Education Research…	2
Child Abuse & Neglect: The…	2
Early Education and…	2
Educational Assessment	2
Educational Research and…	2
Educational Researcher	2
Educational Technology &…	2
European Journal of…	2
Exceptional Children	2
Grantee Submission	2
Journal of Educational…	2
ProQuest LLC	2
Research Papers in Education	2
American Journal of Evaluation	1
Anatomical Sciences Education	1
Applied Cognitive Psychology	1
Applied Measurement in…	1
More ▼

Rupp, Andre A.	3
Thurlow, Martha	3
Bielinski, John	2
Cui, Ying	2
Dunne, Michael P.	2
Engelhard, George, Jr.	2
Ferrara, Steve	2
Frey, Andreas	2
Holling, Heinz	2
Jiao, Hong	2
Kato, Pamela M.	2
Minnema, Jane	2
Mislevy, Robert J.	2
Newton, Paul E.	2
Robitzsch, Alexander	2
Runyan, Desmond K.	2
Scissons, Edward H.	2
Wilhelm, Oliver	2
Williamson, David M.	2
Zolotor, Adam J.	2
de Klerk, Sebastiaan	2
Abedi, Jamal	1
Adams, Wendy K.	1
More ▼

Advanced Placement…	3
Program for International…	3
SAT (College Admission Test)	3
Florida Comprehensive…	2
Beck Anxiety Inventory	1
Behavior Assessment System…	1
California Achievement Tests	1
Center for Epidemiologic…	1
Cognitive Assessment System	1
Continuous Performance Test	1
Early Childhood Longitudinal…	1
Leiter International…	1
Preliminary Scholastic…	1
Progress in International…	1
Rosenberg Self Esteem Scale	1
Self Description Questionnaire	1
Trends in International…	1
Wechsler Intelligence Scale…	1
Woodcock Johnson Tests of…	1
More ▼