ERIC - Search Results

Publication Date

In 2025	2
Since 2024	2
Since 2021 (last 5 years)	6
Since 2016 (last 10 years)	19
Since 2006 (last 20 years)	69

Descriptor

Evaluation Methods	81
Psychometrics	81
Scores	81
Measurement Techniques	19
Test Validity	19
Correlation	18
Test Reliability	16
Comparative Analysis	14
Foreign Countries	14
Item Response Theory	13
Measures (Individuals)	13
Validity	12
Student Evaluation	11
Test Construction	11
Models	10
Error of Measurement	9
Evaluation Research	9
Test Items	9
Testing	9
Elementary School Students	8
Factor Structure	8
Simulation	8
Goodness of Fit	7
Item Analysis	7
Longitudinal Studies	7
More ▼

Publication Type

Journal Articles	64
Reports - Research	46
Reports - Evaluative	17
Dissertations/Theses -…	6
Reports - Descriptive	6
Opinion Papers	4
Tests/Questionnaires	3
Information Analyses	2
Speeches/Meeting Papers	2
Numerical/Quantitative Data	1

Education Level

Elementary Education	10
Higher Education	9
Elementary Secondary Education	8
Early Childhood Education	6
Postsecondary Education	5
Grade 3	4
Grade 4	4
Middle Schools	4
Secondary Education	4
Grade 1	3
Grade 2	3
Junior High Schools	3
Kindergarten	3
Primary Education	3
Grade 5	2
Grade 7	2
Grade 10	1
Grade 6	1
Grade 8	1
Grade 9	1
High Schools	1
Preschool Education	1
More ▼

Audience

Researchers	3
Parents	1

Location

Canada	3
United Kingdom	3
Netherlands	2
Australia	1
Germany	1
Illinois	1
Indonesia	1
Israel	1
Jordan	1
Louisiana	1
Maryland	1
Massachusetts	1
Michigan	1
Missouri	1
New York	1
North Carolina	1
North Dakota	1
Norway	1
Pakistan	1
South Africa	1
South Korea	1
Tennessee	1
Uganda	1
United States	1
More ▼

Laws, Policies, & Programs

What Works Clearinghouse Rating

Showing 1 to 15 of 81 results Save | Export

Examination of the Aggregate Scoring Method in a Judgment Concordance Test

Peer reviewed
PDF on ERIC

Download full text

Deschênes, Marie-France; Dionne, Éric; Dorion, Michelle; Grondin, Julie – Practical Assessment, Research & Evaluation, 2023

The use of the aggregate scoring method for scoring concordance tests requires the weighting of test items to be derived from the performance of a group of experts who take the test under the same conditions as the examinees. However, the average score of experts constituting the reference panel remains a critical issue in the use of these tests.…

Descriptors: Scoring, Tests, Evaluation Methods, Test Items

Three Essays on Making Casual Inferences with Test Scores

Direct link

Sophie Lilit Litschwartz – ProQuest LLC, 2021

In education research test scores are a common object of analysis. Across studies test scores can be an important outcome, a highly predictive covariate, or a means of assigning treatment. However, test scores are a measure of an underlying proficiency we can't observe directly and so contain error. This measurement error has implications for how…

Descriptors: Scores, Inferences, Educational Research, Evaluation Methods

Using Multilabel Neural Network to Score High-Dimensional Assessments for Different Use Foci: An Example with College Major Preference Assessment

Peer reviewed

Direct link

Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025

Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…

Descriptors: Tests, Testing, Scores, Test Construction

Modified Item-Fit Indices for Dichotomous IRT Models with Missing Data

Peer reviewed
PDF on ERIC

Download full text

Direct link

Xue Zhang; Chun Wang – Grantee Submission, 2022

Item-level fit analysis not only serves as a complementary check to global fit analysis, it is also essential in scale development because the fit results will guide item revision and/or deletion (Liu & Maydeu-Olivares, 2014). During data collection, missing response data may likely happen due to various reasons. Chi-square-based item fit…

Descriptors: Goodness of Fit, Item Response Theory, Scores, Test Length

Dynamic Measurement: A Theoretical-Psychometric Paradigm for Modern Educational Psychology

Peer reviewed

Direct link

Dumas, Denis; McNeish, Daniel; Greene, Jeffrey A. – Educational Psychologist, 2020

Scholars have lamented that current methods of assessing student performance do not align with contemporary views of learning as situated within students, contexts, and time. Here, we introduce and describe one theoretical--psychometric paradigm--termed "dynamic measurement"--designed to provide a valid representation of the way students…

Descriptors: Alternative Assessment, Psychometrics, Educational Psychology, Student Evaluation

Psychometric Properties of a Social and Emotional Competence Assessment for Middle School Students

Peer reviewed

Direct link

Clark McKown; Nicole Russo-Ponsaran; Matthew Wronski; Ashley Karls – Grantee Submission, 2025

This study describes the rationale, design, development, and technical properties of SELweb MS, a direct assessment of social and emotional competencies in middle school students. Assessment and item design were iteratively developed with input from youth and experts to measure five domains: Self-Awareness, Self-Management, Social Awareness,…

Descriptors: Psychometrics, Social Emotional Learning, Middle School Students, Correlation

What's Creative about Sentences? A Computational Approach to Assessing Creativity in a Sentence Generation Task

Peer reviewed

Direct link

Weinstein, Theresa J.; Ceh, Simon Majed; Meinel, Christoph; Benedek, Mathias – Creativity Research Journal, 2022

Evaluating creativity of verbal responses or texts is a challenging task due to psychometric issues associated with subjective ratings and the peculiarities of textual data. We explore an approach to objectively assess the creativity of responses in a sentence generation task to (1) better understand what language-related aspects are valued by…

Descriptors: Creativity, Sentences, Natural Language Processing, Computation

Investigation of 2018 ACT Score Declines Final Report

Download full text

Keng, Leslie; Boyer, Michelle – National Center for the Improvement of Educational Assessment, 2020

ACT requested assistance from the National Center for the Improvement of Educational Assessment (Center for Assessment) to investigate declines of scores for states administering the ACT to its 11th grade students in 2018. This request emerged from conversations among state leaders, the Center for Assessment, and ACT in trying to understand the…

Descriptors: College Entrance Examinations, Scores, Test Score Decline, Educational Trends

Increasing the Consequential Validity of Reading Assessment Using Dynamic Measurement Modeling: A Comment on Dumas and McNeish (2017)

Peer reviewed

Direct link

Dumas, Denis G.; McNeish, Daniel M. – Educational Researcher, 2018

Dynamic measurement modeling (DMM) has been shown to improve the consequential validity of longitudinal mathematics assessment in the Early Childhood Longitudinal Study-Kindergarten (ECLS-K) database. Here, the authors demonstrate the capability of DMM to similarly improve the consequential validity of ECLS-K reading assessment through the…

Descriptors: Measurement Techniques, Student Evaluation, Alternative Assessment, Evaluation Methods

Extreme Response Style: Which Model Is Best?

Direct link

Leventhal, Brian – ProQuest LLC, 2017

More robust and rigorous psychometric models, such as multidimensional Item Response Theory models, have been advocated for survey applications. However, item responses may be influenced by construct-irrelevant variance factors such as preferences for extreme response options. Through empirical and simulation methods, this study evaluates the use…

Descriptors: Psychometrics, Item Response Theory, Simulation, Models

Test Assembly Implications for Providing Reliable and Valid Subscores

Peer reviewed

Direct link

Lee, Minji K.; Sweeney, Kevin; Melican, Gerald J. – Educational Assessment, 2017

This study investigates the relationships among factor correlations, inter-item correlations, and the reliability estimates of subscores, providing a guideline with respect to psychometric properties of useful subscores. In addition, it compares subscore estimation methods with respect to reliability and distinctness. The subscore estimation…

Descriptors: Scores, Test Construction, Test Reliability, Test Validity

The Measuring Early Learning Quality & Outcomes Initiative: Purpose, Process and Results

Peer reviewed

Direct link

Raikes, Abbie; Sayre, Rebecca; Davis, Dawn; Anderson, Kate; Hyson, Marilou; Seminario, Evelyn; Burton, Anna – Early Years: An International Journal of Research and Development, 2019

Measuring Early Learning Quality & Outcomes (MELQO) was initiated to address needs for child development and quality of early childhood education (ECE) data, specifically for low- and middle-income countries. Drawing from existing tools, MELQO convened a consortium to create open-source tools to be adapted to national contexts, simultaneously…

Descriptors: Educational Quality, Outcomes of Education, Child Development, Early Childhood Education

Methods for Examining the Psychometric Quality of Subscores: A Review and Application

Peer reviewed
PDF on ERIC

Download full text

Wedman, Jonathan; Lyrén, Per-Erik – Practical Assessment, Research & Evaluation, 2015

When subscores on a test are reported to the test taker, the appropriateness of reporting them depends on whether they provide useful information above what is provided by the total score. Subscores that fail to do so lack adequate psychometric quality and should not be reported. There are several methods for examining the quality of subscores,…

Descriptors: Evaluation Methods, Psychometrics, Scores, Tests

The Development, Validity, and Reliability of a Psychometric Instrument Measuring Competencies in Student Affairs

Peer reviewed

Direct link

Sriram, Rishi – Journal of Student Affairs Research and Practice, 2014

The study of competencies in student affairs began more than 4 decades ago, but no instrument currently exists to measure competencies broadly. This study builds upon previous research by developing an instrument to measure student affairs competencies. Results not only validate the competencies espoused by NASPA and ACPA, but also suggest adding…

Descriptors: Reliability, Psychometrics, Student Personnel Services, Student Personnel Workers

Dynamic Measurement Modeling: Using Nonlinear Growth Models to Estimate Student Learning Capacity

Peer reviewed

Direct link

Dumas, Denis G.; McNeish, Daniel M. – Educational Researcher, 2017

Single-timepoint educational measurement practices are capable of assessing student ability at the time of testing but are not designed to be informative of student capacity for developing in any particular academic domain, despite commonly being used in such a manner. For this reason, such measurement practice systematically underestimates the…

Descriptors: Measurement Techniques, Student Evaluation, Evaluation Methods, Testing

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

ProQuest LLC	6
International Journal of…	5
Educational and Psychological…	3
Grantee Submission	3
Practical Assessment,…	3
Psychometrika	3
Educational Researcher	2
Journal of Educational…	2
Measurement and Evaluation in…	2
Measurement:…	2
Research Quarterly for…	2
Applied Psychological…	1
Assessing Writing	1
Assessment	1
Center for Development and…	1
Child Psychiatry and Human…	1
College Board	1
Comparative Education Review	1
Creativity Research Journal	1
Early Years: An International…	1
Educational Assessment	1
Educational Policy	1
Educational Psychologist	1
Educational Psychology in…	1
Educational Testing Service	1
More ▼

Dumas, Denis G.	2
McKown, Clark	2
McNeish, Daniel M.	2
Raykov, Tenko	2
Sijtsma, Klaas	2
Abu-Hamour, Bashir	1
Albano, Anthony D.	1
Allen, Adelaide M.	1
Amery D. Wu	1
Anderson, Kate	1
Arjoon, Janelle A.	1
Ashley Karls	1
Atkins-Burnett, Sally	1
Baker, Frank B.	1
Barry, Carol L.	1
Bartram, Dave	1
Behizadeh, Nadia	1
Benedek, Mathias	1
Bhamani, Shelina	1
Bjornebekk, Gunnar	1
Boyer, Michelle	1
Bradshaw, Catherine P.	1
Bramley, Tom	1
Branum-Martin, Lee	1
More ▼

Early Childhood Longitudinal…	2
Trends in International…	2
Wechsler Intelligence Scale…	2
ACT Assessment	1
Advanced Placement…	1
Beck Anxiety Inventory	1
Beck Depression Inventory	1
Child Behavior Checklist	1
Cognitive Assessment System	1
Language Development Survey	1
Marlowe Crowne Social…	1
Measures of Academic Progress	1
Preliminary Scholastic…	1
Program for International…	1
Progress in International…	1
SAT (College Admission Test)	1
Self Perception Profile for…	1
Social Skills Rating System	1
Test of Nonverbal Intelligence	1
Wechsler Individual…	1
More ▼