ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	4
Since 2017 (last 10 years)	10
Since 2007 (last 20 years)	25

Descriptor

Statistical Analysis	37
Test Items	37
Test Construction	13
Item Response Theory	12
Item Analysis	8
Scores	8
Computer Software	7
Educational Assessment	6
Models	6
Test Bias	6
Difficulty Level	5
Evaluation Methods	5
Psychometrics	5
Test Validity	5
Computation	4
Foreign Countries	4
Measurement Techniques	4
Multiple Choice Tests	4
Reliability	4
Student Evaluation	4
Test Reliability	4
Testing	4
Comparative Analysis	3
Computer Assisted Testing	3
Error of Measurement	3
More ▼

Publication Type

Reports - Descriptive	37
Journal Articles	33
Numerical/Quantitative Data	2
Opinion Papers	2
Guides - Non-Classroom	1
Reports - Research	1
Speeches/Meeting Papers	1
Tests/Questionnaires	1

Education Level

Higher Education	7
Postsecondary Education	6
Elementary Education	4
Elementary Secondary Education	4
Grade 8	3
Grade 3	2
Grade 4	2
Grade 5	2
Middle Schools	2
Secondary Education	2
Grade 6	1
Grade 7	1
Junior High Schools	1
More ▼

Audience

Researchers	2
Policymakers	1

Location

Arizona	1
Canada	1
Japan	1
Maine	1
Maryland	1
Turkey	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	1
Program for International…	1
SAT (College Admission Test)	1
Test of English as a Foreign…	1
Test of English for…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 37 results Save | Export

Finding the Right Grain-Size for Measurement in the Classroom

Peer reviewed

Direct link

Mark Wilson – Journal of Educational and Behavioral Statistics, 2024

This article introduces a new framework for articulating how educational assessments can be related to teacher uses in the classroom. It articulates three levels of assessment: macro (use of standardized tests), meso (externally developed items), and micro (on-the-fly in the classroom). The first level is the usual context for educational…

Descriptors: Educational Assessment, Measurement, Standardized Tests, Test Items

Designing and Evaluating Tasks to Measure Individual Differences in Experimental Psychology: A Tutorial

Peer reviewed

Direct link

Marc Brysbaert – Cognitive Research: Principles and Implications, 2024

Experimental psychology is witnessing an increase in research on individual differences, which requires the development of new tasks that can reliably assess variations among participants. To do this, cognitive researchers need statistical methods that many researchers have not learned during their training. The lack of expertise can pose…

Descriptors: Experimental Psychology, Individual Differences, Statistical Analysis, Task Analysis

An Introduction to Statistical Techniques Used for Detecting Anomaly in Test Results

Peer reviewed

Direct link

He, Qingping; Meadows, Michelle; Black, Beth – Research Papers in Education, 2022

A potential negative consequence of high-stakes testing is inappropriate test behaviour involving individuals and/or institutions. Inappropriate test behaviour and test collusion can result in aberrant response patterns and anomalous test scores and invalidate the intended interpretation and use of test results. A variety of statistical techniques…

Descriptors: Statistical Analysis, High Stakes Tests, Scores, Response Style (Tests)

Score Comparability Issues with At-Home Testing and How to Address Them

Peer reviewed

Direct link

Puhan, Gautam; Kim, Sooyeon – Journal of Educational Measurement, 2022

As a result of the COVID-19 pandemic, at-home testing has become a popular delivery mode in many testing programs. When programs offer at-home testing to expand their service, the score comparability between test takers testing remotely and those testing in a test center is critical. This article summarizes statistical procedures that could be…

Descriptors: Scores, Scoring, Comparative Analysis, Testing

Multiple Group Item Response Theory Applications Using "Stata irt" Package

Peer reviewed

Direct link

Zheng, Xiaying; Yang, Ji Seung – Measurement: Interdisciplinary Research and Perspectives, 2021

The purpose of this paper is to briefly introduce two most common applications of multiple group item response theory (IRT) models, namely detecting differential item functioning (DIF) analysis and nonequivalent group score linking with a simultaneous calibration. We illustrate how to conduct those analyses using the "Stata" item…

Descriptors: Item Response Theory, Test Bias, Computer Software, Statistical Analysis

Digital Module 16: Longitudinal Data Analysis

Peer reviewed

Direct link

Harring, Jeffrey R.; Johnson, Tessa L. – Educational Measurement: Issues and Practice, 2020

In this digital ITEMS module, Dr. Jeffrey Harring and Ms. Tessa Johnson introduce the linear mixed effects (LME) model as a flexible general framework for simultaneously modeling continuous repeated measures data with a scientifically defensible function that adequately summarizes both individual change as well as the average response. The module…

Descriptors: Educational Assessment, Data Analysis, Longitudinal Studies, Case Studies

Misspecification of Attribute Structure in Diagnostic Measurement

Peer reviewed

Direct link

Liu, Ren – Educational and Psychological Measurement, 2018

Attribute structure is an explicit way of presenting the relationship between attributes in diagnostic measurement. The specification of attribute structures directly affects the classification accuracy resulted from psychometric modeling. This study provides a conceptual framework for understanding misspecifications of attribute structures. Under…

Descriptors: Diagnostic Tests, Classification, Test Construction, Relationship

Research on Psychometric Modeling, Analysis, and Reporting of the National Assessment of Educational Progress

Peer reviewed
PDF on ERIC

Download full text

Direct link

Oranje, Andreas; Kolstad, Andrew – Journal of Educational and Behavioral Statistics, 2019

The design and psychometric methodology of the National Assessment of Educational Progress (NAEP) is constantly evolving to meet the changing interests and demands stemming from a rapidly shifting educational landscape. NAEP has been built on strong research foundations that include conducting extensive evaluations and comparisons before new…

Descriptors: National Competency Tests, Psychometrics, Statistical Analysis, Computation

Easier Said than Done: Rejoinder on Sijtsma and on Green and Yang

Peer reviewed

Direct link

Davenport, Ernest C.; Davison, Mark L.; Liou, Pey-Yan; Love, Quintin U. – Educational Measurement: Issues and Practice, 2016

The main points of Sijtsma and Green and Yang in Educational Measurement: Issues and Practice (34, 4) are that reliability, internal consistency, and unidimensionality are distinct and that Cronbach's alpha may be problematic. Neither of these assertions are at odds with Davenport, Davison, Liou, and Love in the same issue. However, many authors…

Descriptors: Educational Assessment, Reliability, Validity, Test Construction

Rasch Analysis: A Primer for School Psychology Researchers and Practitioners

Peer reviewed

Direct link

Boone, William J.; Noltemeyer, Amity – Cogent Education, 2017

In order to progress as a field, school psychology research must be informed by effective measurement techniques. One approach to address the need for careful measurement is Rasch analysis. This technique can (a) facilitate the development of instruments that provide useful data, (b) provide data that can be used confidently for both descriptive…

Descriptors: Item Response Theory, School Psychology, School Psychologists, Educational Research

Visual Tools for Eliciting Connections and Cohesiveness in Mixed Methods Research

Peer reviewed

Direct link

Murawska, Jaclyn M.; Walker, David A. – Mid-Western Educational Researcher, 2017

In this commentary, we offer a set of visual tools that can assist education researchers, especially those in the field of mathematics, in developing cohesiveness from a mixed methods perspective, commencing at a study's research questions and literature review, through its data collection and analysis, and finally to its results. This expounds…

Descriptors: Mixed Methods Research, Research Methodology, Visual Aids, Research Tools

Real and Artificial Differential Item Functioning

Peer reviewed

Direct link

Andrich, David; Hagquist, Curt – Journal of Educational and Behavioral Statistics, 2012

The literature in modern test theory on procedures for identifying items with differential item functioning (DIF) among two groups of persons includes the Mantel-Haenszel (MH) procedure. Generally, it is not recognized explicitly that if there is real DIF in some items which favor one group, then as an artifact of this procedure, artificial DIF…

Descriptors: Test Bias, Test Items, Item Response Theory, Statistical Analysis

Analysis of the Difficulty and Discrimination Indices of Multiple-Choice Questions According to Cognitive Levels in an Open and Distance Learning Context

Peer reviewed
PDF on ERIC

Download full text

Koçdar, Serpil; Karadag, Nejdet; Sahin, Murat Dogan – Turkish Online Journal of Educational Technology - TOJET, 2016

This is a descriptive study which intends to determine whether the difficulty and discrimination indices of the multiple-choice questions show differences according to cognitive levels of the Bloom's Taxonomy, which are used in the exams of the courses in a business administration bachelor's degree program offered through open and distance…

Descriptors: Multiple Choice Tests, Difficulty Level, Distance Education, Open Education

Uncertainties in the Item Parameter Estimates and Robust Automated Test Assembly

Peer reviewed

Direct link

Veldkamp, Bernard P.; Matteucci, Mariagiulia; de Jong, Martijn G. – Applied Psychological Measurement, 2013

Item response theory parameters have to be estimated, and because of the estimation process, they do have uncertainty in them. In most large-scale testing programs, the parameters are stored in item banks, and automated test assembly algorithms are applied to assemble operational test forms. These algorithms treat item parameters as fixed values,…

Descriptors: Test Construction, Test Items, Item Banks, Automation

Classical Item Analysis Using Latent Variable Modeling: A Note on a Direct Evaluation Procedure

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A. – Structural Equation Modeling: A Multidisciplinary Journal, 2011

A directly applicable latent variable modeling procedure for classical item analysis is outlined. The method allows one to point and interval estimate item difficulty, item correlations, and item-total correlations for composites consisting of categorical items. The approach is readily employed in empirical research and as a by-product permits…

Descriptors: Item Analysis, Evaluation, Correlation, Test Items

Previous Page | Next Page »

Pages: 1 | 2 | 3

Journal of Educational and…	5
Educational and Psychological…	3
Psychometrika	3
Applied Psychological…	2
Educational Measurement:…	2
Behavioral Research and…	1
Cogent Education	1
Cognitive Research:…	1
College Entrance Examination…	1
Educational Technology &…	1
International Journal of…	1
International Journal of…	1
Journal of Chemical Education	1
Journal of Educational…	1
Language Assessment Quarterly	1
Marketing Education Review	1
Measurement:…	1
Mid-Western Educational…	1
Optometric Education	1
Physical Review Special…	1
Practical Assessment,…	1
Research Papers in Education	1
Structural Equation Modeling:…	1
Thought Currents in English…	1
Turkish Online Journal of…	1
More ▼

Al-A'ali, Mansoor	1
Al-Sabah, Walid S.	1
Andrich, David	1
Black, Beth	1
Boone, William J.	1
Bradlow, Eric T.	1
Braeken, Johan	1
Burkett, Allan R.	1
Coverdale, Bradley J.	1
Crovo, Mary L.	1
Cziko, Gary A.	1
Davenport, Ernest C.	1
Davison, Mark L.	1
De Boeck, Paul	1
Doran, Harold C.	1
Gierl, Mark J.	1
Hagquist, Curt	1
Harring, Jeffrey R.	1
He, Qingping	1
Hessen, David J.	1
Jin, Ying	1
Joarder, Anwar H.	1
Johnson, Tessa L.	1
Jung, Eunju	1
Karadag, Nejdet	1
More ▼