ERIC - Search Results

Publication Date

In 2025	2
Since 2024	6
Since 2021 (last 5 years)	11
Since 2016 (last 10 years)	22
Since 2006 (last 20 years)	33

Descriptor

Error of Measurement	61
Test Construction	61
Test Reliability	61
Test Validity	26
Item Response Theory	14
Criterion Referenced Tests	13
Item Analysis	13
Test Items	13
Test Interpretation	10
Foreign Countries	9
Scores	9
Scoring	9
Norm Referenced Tests	8
Psychometrics	8
Test Bias	8
Achievement Tests	7
English	7
Test Theory	7
Testing	7
Academic Achievement	6
Computer Assisted Testing	6
Data Collection	6
Grade 5	6
Grade 7	6
Grade 8	6
More ▼

Publication Type

Reports - Research	32
Journal Articles	27
Reports - Descriptive	13
Speeches/Meeting Papers	9
Numerical/Quantitative Data	7
Reports - Evaluative	6
Tests/Questionnaires	3
Guides - Non-Classroom	1
Information Analyses	1
Opinion Papers	1

Education Level

Elementary Education	9
Higher Education	9
Postsecondary Education	7
Secondary Education	7
Early Childhood Education	6
Elementary Secondary Education	6
Grade 5	6
Grade 8	6
Intermediate Grades	6
Junior High Schools	6
Middle Schools	6
Primary Education	6
Grade 3	5
Grade 4	5
Grade 6	5
Grade 7	5
Grade 10	1
High Schools	1
More ▼

Audience

Researchers	2
Teachers	1

Location

New York	5
Canada	2
New Mexico	2
Turkey	2
Australia	1
Denmark	1
Ethiopia	1
Florida	1
Italy	1
Japan	1
Maine	1
Mississippi	1
North America	1
Virginia	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Iowa Tests of Basic Skills	2
Beck Depression Inventory	1
Cognitive Abilities Test	1
Conners Rating Scales	1
Iowa Tests of Educational…	1
MacArthur Communicative…	1
National Assessment of…	1
New Jersey College Basic…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 61 results Save | Export

Detecting Differential Item Functioning among Multiple Groups Using IRT Residual DIF Framework

Peer reviewed

Direct link

Hwanggyu Lim; Danqi Zhu; Edison M. Choe; Kyung T. Han – Journal of Educational Measurement, 2024

This study presents a generalized version of the residual differential item functioning (RDIF) detection framework in item response theory, named GRDIF, to analyze differential item functioning (DIF) in multiple groups. The GRDIF framework retains the advantages of the original RDIF framework, such as computational efficiency and ease of…

Descriptors: Item Response Theory, Test Bias, Test Reliability, Test Construction

Exploring the Influence of Response Styles on Continuous Scale Assessments: Insights from a Novel Modeling Approach

Peer reviewed

Direct link

Hung-Yu Huang – Educational and Psychological Measurement, 2025

The use of discrete categorical formats to assess psychological traits has a long-standing tradition that is deeply embedded in item response theory models. The increasing prevalence and endorsement of computer- or web-based testing has led to greater focus on continuous response formats, which offer numerous advantages in both respondent…

Descriptors: Response Style (Tests), Psychological Characteristics, Item Response Theory, Test Reliability

Validation of the Higher Education Student Engagement Scale in Use for Program Evaluation

Peer reviewed

Direct link

Stella Y. Kim; Carl Westine; Tong Wu; Derek Maher – Journal of College Student Retention: Research, Theory & Practice, 2024

The primary purpose of this study is to validate a student engagement measure for its use in evaluation of a learning assistant (LA) program. A series of psychometric evaluations were made for both the original scale of Higher Education Student Engagement Scale (HESES) and its adapted version designed to be used in gauging the effectiveness of…

Descriptors: Learner Engagement, Teaching Assistants, Test Validity, Test Reliability

Practical Considerations in Choosing an Anchor Test Form for Equating under the Random Groups Design

Peer reviewed

Direct link

Cui, Zhongmin; He, Yong – Measurement: Interdisciplinary Research and Perspectives, 2023

Careful considerations are necessary when there is a need to choose an anchor test form from a list of old test forms for equating under the random groups design. The choice of the anchor form potentially affects the accuracy of equated scores on new test forms. Few guidelines, however, can be found in the literature on choosing the anchor form.…

Descriptors: Test Format, Equated Scores, Best Practices, Test Construction

Lagged Dependent Variable Predictors, Classical Measurement Error, and Path Dependency: The Conditions under Which Various Estimators Are Appropriate

Peer reviewed

Direct link

Anders Holm; Anders Hjorth-Trolle; Robert Andersen – Sociological Methods & Research, 2025

Lagged dependent variables (LDVs) are often used as predictors in ordinary least squares (OLS) models in the social sciences. Although several estimators are commonly employed, little is known about their relative merits in the presence of classical measurement error and different longitudinal processes. We assess the performance of four commonly…

Descriptors: Elementary Education, Scores, Error of Measurement, Predictor Variables

The Short Inventory of Creative Activities (S-ICA): Compiling a Short Scale Using Ant Colony Optimization

Peer reviewed

Direct link

D. Steger; S. Weiss; O. Wilhelm – Creativity Research Journal, 2023

Creativity can be measured with a variety of methods including self-reports, others reports, and ability tests. While typical self-reports are best understood as weak proxies of creativity, biographical reports that assess previous creative activities seem more promising. Drawbacks of such measures -- including skewed item distributions, a lack of…

Descriptors: Creativity, Creativity Tests, Test Construction, Algorithms

Development and Psychometric Evaluation of the Open-Source Challenging Behavior Scale (OS-CBS)

Peer reviewed

Direct link

Frazier, Thomas W.; Khaliq, Izma; Scullin, Keeley; Uljarevic, Mirko; Shih, Andy; Karpur, Arun – Journal of Autism and Developmental Disorders, 2023

At present, there are no brief, freely-available, informant-report measures that evaluate key challenging behaviors relevant to youth with autism spectrum disorder (ASD) or other developmental disabilities (DD). This paper describes the development, refinement, and initial psychometric evaluation of a new 18-item measure, the Open-Source…

Descriptors: Test Construction, Psychometrics, Behavior Problems, Autism Spectrum Disorders

Hurdles to Learning Assessment Quality: Their Detrimental Effects on Student Learning

Peer reviewed
PDF on ERIC

Download full text

Firdissa J. Aga – Intersection: A Journal at the Intersection of Assessment and Learning, 2024

The study investigated hurdles to the quality of student learning assessment by examining issues related to assessment procedures and practices, learners and learning, learning resources and test constructs, and test admin and feedback. Quantitative and qualitative data were collected from two Ethiopian universities using two types of…

Descriptors: Foreign Countries, College Faculty, College Students, Test Construction

The Invariance Paradox: Using Optimal Test Design to Minimize Bias

Peer reviewed

Direct link

Jones, Andrew T.; Kopp, Jason P.; Ong, Thai Q. – Educational Measurement: Issues and Practice, 2020

Studies investigating invariance have often been limited to measurement or prediction invariance. Selection invariance, wherein the use of test scores for classification results in equivalent classification accuracy between groups, has received comparatively little attention in the psychometric literature. Previous research suggests that some form…

Descriptors: Test Construction, Test Bias, Classification, Accuracy

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

Perceived Teacher Informal Relationship Scale: A Scale Development and Measurement Invariance Study

Peer reviewed
PDF on ERIC

Download full text

Demirtas, Zülfü; Çaçan, Hanifi; Uslukaya, Alper – International Journal of Contemporary Educational Research, 2023

This work is intended to develop a measuring tool for determining teacher perception of informal relationships. The pool of items created by researchers through a literature review has been presented with expert assessment of the validity of the content, face, and meaning, and a draft scale has been created by making necessary revisions to the…

Descriptors: Foreign Countries, Teacher Attitudes, Likert Scales, Test Construction

Charting the Future of Assessments. Full Report

Download full text

Patrick C. Kyllonen; Amit Sevak; Teresa Ober; Ikkyu Choi; Jesse Sparks; Daniel Fishtein – ETS Research Institute, 2024

Assessment refers to a broad array of approaches for measuring or evaluating a person's (or group of persons') skills, behaviors, dispositions, or other attributes. Assessments range from standardized tests used in admissions, employee selection, licensure examinations, and domestic and international largescale assessments of cognitive and…

Descriptors: Performance Based Assessment, Evaluation Criteria, Evaluation Methods, Test Bias

The Italian Version of the State-Trait Cheerfulness Inventory Trait Form: Psychometric Validation and Evaluation of Measurement Invariance

Peer reviewed

Direct link

Lau, Chloe; Chiesi, Francesca; Hofmann, Jennifer; Ruch, Willibald; Saklofske, Donald H. – Journal of Psychoeducational Assessment, 2020

The State-Trait Cheerfulness Inventory--Trait Version (STCI-T60) measures the temperamental basis of sense of humor involving theoretically derived personality dispositions of cheerfulness, seriousness, and bad mood. The reliability and validity of the newly developed STCI-T60 Italian version were assessed in a sample of Italian speakers (N =…

Descriptors: Foreign Countries, Personality Traits, Psychometrics, Test Construction

Multicultural Competence Scale for Prospective Teachers: Development, Validation and Measurement Invariance

Peer reviewed
PDF on ERIC

Download full text

Erdem, Devrim – Eurasian Journal of Educational Research, 2020

Purpose: This study reports on the development, validation and measurement invariance of the Multicultural Competency Scale (MCS) for pre-service teachers. Research Methods: Data from 640 pre-service teachers were collected for two studies. After data screening procedures 628 responses were left. The data were divided into two sets for exploratory…

Descriptors: Rating Scales, Multicultural Education, Cultural Awareness, Teacher Competencies

The Young Adults Form of the Attitude toward Women's Working Scale: Development, Preliminary Validation and Measurement Invariance

Peer reviewed
PDF on ERIC

Download full text

Erdem, Devrim – International Journal of Assessment Tools in Education, 2020

The purpose of this study was to develop a scale measuring attitudes toward women's working. In line with this main purpose, two studies were conducted to develop the tool and investigate its psychometric properties in two different samples. The study 1 started with generating item pool, conducting exploratory factor analysis to identify…

Descriptors: Young Adults, Employed Women, Test Construction, Error of Measurement

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Journal of Educational…	5
New York State Education…	5
Educational Measurement:…	3
Journal of Psychoeducational…	2
New Mexico Public Education…	2
Applied Measurement in…	1
Biochemistry and Molecular…	1
Clinical Linguistics &…	1
Creativity Research Journal	1
ETS Research Institute	1
Educ Psychol Meas	1
Education and Information…	1
Educational and Psychological…	1
Eurasian Journal of…	1
Grantee Submission	1
IEEE Transactions on Education	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Intersection: A Journal at…	1
Journal of Agronomic…	1
Journal of Autism and…	1
Journal of College Student…	1
Journal of General Education	1
More ▼

Brennan, Robert L.	3
Haladyna, Tom	3
Erdem, Devrim	2
Livingston, Samuel A.	2
Patience, Wayne M.	2
Reckase, Mark D.	2
Roid, Gale	2
Amit Sevak	1
Anders Hjorth-Trolle	1
Anders Holm	1
Anderson, Trevor R.	1
Bardhoshi, Gerta	1
Bateman, Andrea	1
Benson, Jeri	1
Bichi, Ado Abdu	1
Bleses, Dorthe	1
Bray, Wendy S.	1
Bridgeman, Brent	1
Bristow, M.	1
Cameron, Lynn	1
Cantor, Nancy K.	1
Carl Westine	1
Chambers, William V.	1
Chiesi, Francesca	1
More ▼