ERIC - Search Results

Publication Date

In 2025	3
Since 2024	13
Since 2021 (last 5 years)	20
Since 2016 (last 10 years)	45
Since 2006 (last 20 years)	81

Descriptor

Error of Measurement	164
Test Construction	164
Test Reliability	61
Test Items	50
Test Validity	41
Item Response Theory	40
Scores	30
Item Analysis	27
Psychometrics	26
Equated Scores	20
Achievement Tests	19
Criterion Referenced Tests	19
Difficulty Level	19
Test Format	18
Computer Assisted Testing	17
Mathematics Tests	17
Statistical Analysis	17
Test Interpretation	17
Measurement Techniques	16
Scoring	16
Testing Problems	16
Foreign Countries	15
Student Evaluation	15
Comparative Analysis	14
Goodness of Fit	14
More ▼

Publication Type

Journal Articles	86
Reports - Research	79
Reports - Descriptive	34
Reports - Evaluative	26
Speeches/Meeting Papers	23
Numerical/Quantitative Data	11
Tests/Questionnaires	7
Dissertations/Theses -…	5
Guides - Non-Classroom	3
Information Analyses	3
Opinion Papers	3
Book/Product Reviews	1
Books	1
Reports - General	1
More ▼

Education Level

Elementary Education	16
Secondary Education	15
Higher Education	13
Postsecondary Education	11
Grade 3	10
Middle Schools	10
Grade 4	9
Junior High Schools	9
Early Childhood Education	8
Grade 5	8
Grade 8	8
Primary Education	8
Elementary Secondary Education	7
Grade 6	7
Grade 7	7
Intermediate Grades	7
Grade 2	5
High Schools	4
Adult Education	1
Grade 1	1
Grade 10	1
Grade 9	1
Kindergarten	1
More ▼

Audience

Researchers	6
Practitioners	1
Students	1
Teachers	1

Location

New York	5
Canada	3
Australia	2
Japan	2
New Mexico	2
Turkey	2
Arkansas	1
Chile	1
Colorado (Boulder)	1
Denmark	1
Ethiopia	1
Florida	1
Italy	1
Maine	1
Mississippi	1
North America	1
Oregon	1
Portugal	1
Virginia	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	2
Race to the Top	1

Assessments and Surveys

National Assessment of…	5
Iowa Tests of Basic Skills	3
Graduate Record Examinations	2
SAT (College Admission Test)	2
ACT Assessment	1
Beck Depression Inventory	1
Center for Epidemiologic…	1
Cognitive Abilities Test	1
Conners Rating Scales	1
Dynamic Indicators of Basic…	1
Graduate Management Admission…	1
Iowa Tests of Educational…	1
MacArthur Communicative…	1
Measures of Academic Progress	1
New Jersey College Basic…	1
Program for International…	1
Rod and Frame Test	1
Test of English for…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 164 results Save | Export

Detecting Differential Item Functioning among Multiple Groups Using IRT Residual DIF Framework

Peer reviewed

Direct link

Hwanggyu Lim; Danqi Zhu; Edison M. Choe; Kyung T. Han – Journal of Educational Measurement, 2024

This study presents a generalized version of the residual differential item functioning (RDIF) detection framework in item response theory, named GRDIF, to analyze differential item functioning (DIF) in multiple groups. The GRDIF framework retains the advantages of the original RDIF framework, such as computational efficiency and ease of…

Descriptors: Item Response Theory, Test Bias, Test Reliability, Test Construction

Detecting Multidimensional DIF in Polytomous Items with IRT Methods and Estimation Approaches

Peer reviewed

Direct link

Güler Yavuz Temel – Journal of Educational Measurement, 2024

The purpose of this study was to investigate multidimensional DIF with a simple and nonsimple structure in the context of multidimensional Graded Response Model (MGRM). This study examined and compared the performance of the IRT-LR and Wald test using MML-EM and MHRM estimation approaches with different test factors and test structures in…

Descriptors: Computation, Multidimensional Scaling, Item Response Theory, Models

New Developments in Measurement Invariance Testing: An Overview and Comparison of EFA-Based Approaches

Peer reviewed

Direct link

Philipp Sterner; Kim De Roover; David Goretzko – Structural Equation Modeling: A Multidisciplinary Journal, 2025

When comparing relations and means of latent variables, it is important to establish measurement invariance (MI). Most methods to assess MI are based on confirmatory factor analysis (CFA). Recently, new methods have been developed based on exploratory factor analysis (EFA); most notably, as extensions of multi-group EFA, researchers introduced…

Descriptors: Error of Measurement, Measurement Techniques, Factor Analysis, Structural Equation Models

Exploring the Influence of Response Styles on Continuous Scale Assessments: Insights from a Novel Modeling Approach

Peer reviewed

Direct link

Hung-Yu Huang – Educational and Psychological Measurement, 2025

The use of discrete categorical formats to assess psychological traits has a long-standing tradition that is deeply embedded in item response theory models. The increasing prevalence and endorsement of computer- or web-based testing has led to greater focus on continuous response formats, which offer numerous advantages in both respondent…

Descriptors: Response Style (Tests), Psychological Characteristics, Item Response Theory, Test Reliability

Validation of the Higher Education Student Engagement Scale in Use for Program Evaluation

Peer reviewed

Direct link

Stella Y. Kim; Carl Westine; Tong Wu; Derek Maher – Journal of College Student Retention: Research, Theory & Practice, 2024

The primary purpose of this study is to validate a student engagement measure for its use in evaluation of a learning assistant (LA) program. A series of psychometric evaluations were made for both the original scale of Higher Education Student Engagement Scale (HESES) and its adapted version designed to be used in gauging the effectiveness of…

Descriptors: Learner Engagement, Teaching Assistants, Test Validity, Test Reliability

Aiming at Creativity and Ending up with a Range from Low-Hanging Fruits to Foolishness: A Reflective Model of Creativity

Peer reviewed

Direct link

Nicolas Pichot; Boris Forthmann; Eric Bonetto; Thomas Arciszewski; Nathalie Bonnardel; Sara Jaubert; Jean B. Pavani – Journal of Creative Behavior, 2024

The term "creative" is commonly used in everyday language and in academic discourse to discuss the nature of artistic and innovative productions. This usage inherently implies the existence of a variable of creativity that allows different creative works to be compared. The standard definition of creativity asserts that a production must…

Descriptors: Creativity, Test Construction, Test Validity, Productive Thinking

Practical Considerations in Choosing an Anchor Test Form for Equating under the Random Groups Design

Peer reviewed

Direct link

Cui, Zhongmin; He, Yong – Measurement: Interdisciplinary Research and Perspectives, 2023

Careful considerations are necessary when there is a need to choose an anchor test form from a list of old test forms for equating under the random groups design. The choice of the anchor form potentially affects the accuracy of equated scores on new test forms. Few guidelines, however, can be found in the literature on choosing the anchor form.…

Descriptors: Test Format, Equated Scores, Best Practices, Test Construction

Addressing Current Methodological Challenges in ILSA's Transition to Adaptive Testing

Direct link

Montserrat Beatriz Valdivia Medinaceli – ProQuest LLC, 2023

My dissertation examines three current challenges of international large-scale assessments (ILSAs) associated with the transition from linear testing to an adaptive testing design. ILSAs are important for making comparisons among populations and informing countries about the quality of their educational systems. ILSA's results inform policymakers…

Descriptors: International Assessment, Achievement Tests, Adaptive Testing, Test Items

Psychometric Validation and Gender Invariance Analysis of a Revised Version of the Attitudes towards Research Scale (EACIN-23) in a Chilean University Student Sample

Peer reviewed

Direct link

G. R. Quintana; I. Dufraix; J. I. Escudero-Pasten; J. F. Santibáñez-Palma; C. Figueroa-Grenett – Cogent Education, 2024

Scientific research is vital for student's education, fostering critical thinking, problem-solving skills, and deepening subject knowledge. To assess students' attitudes towards research, the attitude towards research scale was developed (EACIN). This study addresses three gaps regarding this instrument: inconsistent latent structure, lack of…

Descriptors: Foreign Countries, Undergraduate Students, Psychometrics, Gender Differences

Lagged Dependent Variable Predictors, Classical Measurement Error, and Path Dependency: The Conditions under Which Various Estimators Are Appropriate

Peer reviewed

Direct link

Anders Holm; Anders Hjorth-Trolle; Robert Andersen – Sociological Methods & Research, 2025

Lagged dependent variables (LDVs) are often used as predictors in ordinary least squares (OLS) models in the social sciences. Although several estimators are commonly employed, little is known about their relative merits in the presence of classical measurement error and different longitudinal processes. We assess the performance of four commonly…

Descriptors: Elementary Education, Scores, Error of Measurement, Predictor Variables

Measurement Invariance of the Arabic Version of the Flourishing Scale in a Sample of Special Education Teachers

Peer reviewed

Direct link

AL-Dossary, Saeed A.; Almohayya, Bander M. – Psychology in the Schools, 2024

The present study aims to validate the Flourishing Scale (FS) in a convenience sample of 233 special education teachers. The FS's psychometric properties were investigated using exploratory factor analysis (EFA) and confirmatory factor analysis (CFA). EFA had a one-factor solution that explained 49.9% of the variance, a Cronbach's alpha internal…

Descriptors: Error of Measurement, Arabic, Test Construction, Special Education Teachers

The Short Inventory of Creative Activities (S-ICA): Compiling a Short Scale Using Ant Colony Optimization

Peer reviewed

Direct link

D. Steger; S. Weiss; O. Wilhelm – Creativity Research Journal, 2023

Creativity can be measured with a variety of methods including self-reports, others reports, and ability tests. While typical self-reports are best understood as weak proxies of creativity, biographical reports that assess previous creative activities seem more promising. Drawbacks of such measures -- including skewed item distributions, a lack of…

Descriptors: Creativity, Creativity Tests, Test Construction, Algorithms

Development and Psychometric Evaluation of the Open-Source Challenging Behavior Scale (OS-CBS)

Peer reviewed

Direct link

Frazier, Thomas W.; Khaliq, Izma; Scullin, Keeley; Uljarevic, Mirko; Shih, Andy; Karpur, Arun – Journal of Autism and Developmental Disorders, 2023

At present, there are no brief, freely-available, informant-report measures that evaluate key challenging behaviors relevant to youth with autism spectrum disorder (ASD) or other developmental disabilities (DD). This paper describes the development, refinement, and initial psychometric evaluation of a new 18-item measure, the Open-Source…

Descriptors: Test Construction, Psychometrics, Behavior Problems, Autism Spectrum Disorders

Hurdles to Learning Assessment Quality: Their Detrimental Effects on Student Learning

Peer reviewed
PDF on ERIC

Download full text

Firdissa J. Aga – Intersection: A Journal at the Intersection of Assessment and Learning, 2024

The study investigated hurdles to the quality of student learning assessment by examining issues related to assessment procedures and practices, learners and learning, learning resources and test constructs, and test admin and feedback. Quantitative and qualitative data were collected from two Ethiopian universities using two types of…

Descriptors: Foreign Countries, College Faculty, College Students, Test Construction

Reframing Research and Assessment Practices: Advancing an Antiracist and Anti-Ableist Research Agenda

Peer reviewed

Direct link

Angela Johnson; Elizabeth Barker; Marcos Viveros Cespedes – Educational Measurement: Issues and Practice, 2024

Educators and researchers strive to build policies and practices on data and evidence, especially on academic achievement scores. When assessment scores are inaccurate for specific student populations or when scores are inappropriately used, even data-driven decisions will be misinformed. To maximize the impact of the research-practice-policy…

Descriptors: Equal Education, Inclusion, Evaluation Methods, Error of Measurement

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11

Journal of Educational…	11
Applied Measurement in…	9
Educational Measurement:…	7
ETS Research Report Series	6
New York State Education…	5
ProQuest LLC	5
International Journal of…	4
Behavioral Research and…	3
Educational and Psychological…	3
Educational Researcher	2
International Journal of…	2
Journal of Educational and…	2
Journal of Psychoeducational…	2
Measurement and Evaluation in…	2
National Center for Education…	2
New Mexico Public Education…	2
Perceptual and Motor Skills	2
Psychology in the Schools	2
Psychometrika	2
Alberta Journal of…	1
American Institutes for…	1
Assessment & Evaluation in…	1
Assessment for Effective…	1
Biochemistry and Molecular…	1
British Journal of…	1
More ▼

Haladyna, Tom	4
Alonzo, Julie	3
Brennan, Robert L.	3
Hambleton, Ronald K.	3
Livingston, Samuel A.	3
Lord, Frederic M.	3
Roid, Gale	3
Solano-Flores, Guillermo	3
Tindal, Gerald	3
Dever, Jill A.	2
Dorans, Neil J.	2
Erdem, Devrim	2
Fritch, Laura Burns	2
Green, Donald Ross	2
Haladyna, Thomas M.	2
Herget, Deborah R.	2
Ingels, Steven J.	2
Kitmitto, Sami	2
Kolen, Michael J.	2
Leinwand, Steve	2
Liu, Kimy	2
Moses, Tim	2
Ottem, Randolph	2
Patience, Wayne M.	2
More ▼