ERIC - Search Results

Publication Date

In 2025	1
Since 2024	3
Since 2021 (last 5 years)	9
Since 2016 (last 10 years)	46
Since 2006 (last 20 years)	144

Descriptor

Statistical Analysis	212
Test Construction	47
Evaluation Methods	38
Test Items	37
Foreign Countries	34
Scores	30
Models	26
Test Reliability	25
Test Validity	25
Tests	25
Academic Achievement	23
Computation	22
Comparative Analysis	21
Computer Software	20
Item Response Theory	20
Research Methodology	20
Standardized Tests	20
Higher Education	17
Measurement Techniques	17
Psychometrics	17
Student Evaluation	17
Educational Research	16
Test Bias	16
Data Analysis	15
English (Second Language)	15
More ▼

Publication Type

Reports - Descriptive	212
Journal Articles	158
Speeches/Meeting Papers	12
Guides - Non-Classroom	7
Numerical/Quantitative Data	7
Reports - Research	5
Information Analyses	3
Opinion Papers	3
Books	2
Guides - General	2
Tests/Questionnaires	2
Collected Works - Serial	1
Guides - Classroom - Teacher	1
Reference Materials - General	1
Reports - Evaluative	1
More ▼

Education Level

Higher Education	40
Elementary Secondary Education	23
Postsecondary Education	20
Elementary Education	16
Secondary Education	13
Grade 8	8
High Schools	8
Middle Schools	8
Grade 3	5
Grade 4	5
Grade 5	4
Grade 6	4
Grade 7	4
Junior High Schools	4
Early Childhood Education	3
Grade 1	2
Grade 2	2
Grade 9	2
Kindergarten	2
Primary Education	2
Adult Education	1
Grade 10	1
Intermediate Grades	1
More ▼

Audience

Researchers	6
Practitioners	4
Teachers	4
Policymakers	2

Location

Australia	4
Michigan	4
United Kingdom	4
California	3
New York	3
North Carolina	3
Texas	3
United States	3
Brazil	2
Canada	2
Indiana	2
Japan	2
Missouri	2
Netherlands	2
South Africa	2
United Kingdom (England)	2
Arizona	1
China	1
Denmark	1
European Union	1
France (Paris)	1
Germany	1
Hawaii	1
Hong Kong	1
Hungary	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	4
Individuals with Disabilities…	1
Race to the Top	1

What Works Clearinghouse Rating

Showing 1 to 15 of 212 results Save | Export

An Introduction to Statistical Techniques Used for Detecting Anomaly in Test Results

Peer reviewed

Direct link

He, Qingping; Meadows, Michelle; Black, Beth – Research Papers in Education, 2022

A potential negative consequence of high-stakes testing is inappropriate test behaviour involving individuals and/or institutions. Inappropriate test behaviour and test collusion can result in aberrant response patterns and anomalous test scores and invalidate the intended interpretation and use of test results. A variety of statistical techniques…

Descriptors: Statistical Analysis, High Stakes Tests, Scores, Response Style (Tests)

Finding the Right Grain-Size for Measurement in the Classroom

Peer reviewed

Direct link

Mark Wilson – Journal of Educational and Behavioral Statistics, 2024

This article introduces a new framework for articulating how educational assessments can be related to teacher uses in the classroom. It articulates three levels of assessment: macro (use of standardized tests), meso (externally developed items), and micro (on-the-fly in the classroom). The first level is the usual context for educational…

Descriptors: Educational Assessment, Measurement, Standardized Tests, Test Items

Designing and Evaluating Tasks to Measure Individual Differences in Experimental Psychology: A Tutorial

Peer reviewed

Direct link

Marc Brysbaert – Cognitive Research: Principles and Implications, 2024

Experimental psychology is witnessing an increase in research on individual differences, which requires the development of new tasks that can reliably assess variations among participants. To do this, cognitive researchers need statistical methods that many researchers have not learned during their training. The lack of expertise can pose…

Descriptors: Experimental Psychology, Individual Differences, Statistical Analysis, Task Analysis

Multiple Group Item Response Theory Applications Using "Stata irt" Package

Peer reviewed

Direct link

Zheng, Xiaying; Yang, Ji Seung – Measurement: Interdisciplinary Research and Perspectives, 2021

The purpose of this paper is to briefly introduce two most common applications of multiple group item response theory (IRT) models, namely detecting differential item functioning (DIF) analysis and nonequivalent group score linking with a simultaneous calibration. We illustrate how to conduct those analyses using the "Stata" item…

Descriptors: Item Response Theory, Test Bias, Computer Software, Statistical Analysis

A Critical View on the NEAT Equating Design: Statistical Modeling and Identifiability Problems

Peer reviewed

Direct link

San Martín, Ernesto; González, Jorge – Journal of Educational and Behavioral Statistics, 2022

The nonequivalent groups with anchor test (NEAT) design is widely used in test equating. Under this design, two groups of examinees are administered different test forms with each test form containing a subset of common items. Because test takers from different groups are assigned only one test form, missing score data emerge by design rendering…

Descriptors: Tests, Scores, Statistical Analysis, Models

Detecting Aberrant Response Behavior with Nonparametric Method: Mokken and PerFit Packages in RStudio

Peer reviewed

Direct link

Sengül Avsar, Asiye – Measurement: Interdisciplinary Research and Perspectives, 2020

In order to reach valid and reliable test scores, various test theories have been developed, and one of them is nonparametric item response theory (NIRT). Mokken Models are the most widely known NIRT models which are useful for small samples and short tests. Mokken Package is useful for Mokken Scale Analysis. An important issue about validity is…

Descriptors: Response Style (Tests), Nonparametric Statistics, Item Response Theory, Test Validity

Sample Representativeness. Improving Literacy Brief: Understanding Screening

Direct link

Pentimonti, J.; Petscher, Y.; Stanley, C. – National Center on Improving Literacy, 2019

Sample representativeness is an important piece to consider when evaluating the quality of a screening assessment. If you are trying to determine whether or not the screening tool accurately measures children's skills, you want to ensure that the sample that is used to validate the tool is representative of your population of interest.

Descriptors: Sampling, Screening Tests, Measurement, Test Validity

A Measurement Is a Choice and Stevens' Scales of Measurement Do Not Help Make It: A Response to Chalmers

Peer reviewed

Direct link

Zumbo, Bruno D.; Kroc, Edward – Educational and Psychological Measurement, 2019

Chalmers recently published a critique of the use of ordinal a[alpha] proposed in Zumbo et al. as a measure of test reliability in certain research settings. In this response, we take up the task of refuting Chalmers' critique. We identify three broad misconceptions that characterize Chalmers' criticisms: (1) confusing assumptions with…

Descriptors: Test Reliability, Statistical Analysis, Misconceptions, Mathematical Models

Misspecification of Attribute Structure in Diagnostic Measurement

Peer reviewed

Direct link

Liu, Ren – Educational and Psychological Measurement, 2018

Attribute structure is an explicit way of presenting the relationship between attributes in diagnostic measurement. The specification of attribute structures directly affects the classification accuracy resulted from psychometric modeling. This study provides a conceptual framework for understanding misspecifications of attribute structures. Under…

Descriptors: Diagnostic Tests, Classification, Test Construction, Relationship

Digital Module 16: Longitudinal Data Analysis

Peer reviewed

Direct link

Harring, Jeffrey R.; Johnson, Tessa L. – Educational Measurement: Issues and Practice, 2020

In this digital ITEMS module, Dr. Jeffrey Harring and Ms. Tessa Johnson introduce the linear mixed effects (LME) model as a flexible general framework for simultaneously modeling continuous repeated measures data with a scientifically defensible function that adequately summarizes both individual change as well as the average response. The module…

Descriptors: Educational Assessment, Data Analysis, Longitudinal Studies, Case Studies

Scientific Research Methodology and Experimental Design in Education and Language Learning Studies: Basics and Guidelines

Download full text

Mahmoud M. S. Abdallah – Online Submission, 2025

This guide offers a comprehensive handbook to scientific research methodology and experimental design, specifically for novice MA and PhD researchers in Education and Language Learning (TESOL/TEFL). It establishes scientific research as a systematic, objective inquiry focused on identifying cause-and-effect relationships through empirical data.…

Descriptors: Scientific Research, Research Methodology, Research Design, Second Language Learning

Research on Psychometric Modeling, Analysis, and Reporting of the National Assessment of Educational Progress

Peer reviewed
PDF on ERIC

Download full text

Direct link

Oranje, Andreas; Kolstad, Andrew – Journal of Educational and Behavioral Statistics, 2019

The design and psychometric methodology of the National Assessment of Educational Progress (NAEP) is constantly evolving to meet the changing interests and demands stemming from a rapidly shifting educational landscape. NAEP has been built on strong research foundations that include conducting extensive evaluations and comparisons before new…

Descriptors: National Competency Tests, Psychometrics, Statistical Analysis, Computation

ACT Reporting Category Interpretation Guide: Version 1.0. ACT Working Paper 2016 (05)

Download full text

Powers, Sonya; Li, Dongmei; Suh, Hongwook; Harris, Deborah J. – ACT, Inc., 2016

ACT reporting categories and ACT Readiness Ranges are new features added to the ACT score reports starting in fall 2016. For each reporting category, the number correct score, the maximum points possible, the percent correct, and the ACT Readiness Range, along with an indicator of whether the reporting category score falls within the Readiness…

Descriptors: Scores, Classification, College Entrance Examinations, Error of Measurement

Cognitive Diagnostic Assessment: Issues and Considerations

Peer reviewed
PDF on ERIC

Download full text

Javidanmehr, Zahra; Anani Sarab, Mohammad Reza – International Journal of Language Testing, 2017

Cognitive Diagnostic Assessment (CDA) is a type of educational assessment that is designed to measure specific knowledge structures and processing skills in students so as to provide information about their cognitive strengths and weaknesses (Leighton & Gierl, 2007). CDA has been instrumental in turning the attention of practitioners to more…

Descriptors: Cognitive Tests, Diagnostic Tests, Educational Assessment, Second Language Learning

Score Comparability Issues with At-Home Testing and How to Address Them

Peer reviewed

Direct link

Puhan, Gautam; Kim, Sooyeon – Journal of Educational Measurement, 2022

As a result of the COVID-19 pandemic, at-home testing has become a popular delivery mode in many testing programs. When programs offer at-home testing to expand their service, the score comparability between test takers testing remotely and those testing in a test center is critical. This article summarizes statistical procedures that could be…

Descriptors: Scores, Scoring, Comparative Analysis, Testing

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 15

Journal of Educational and…	10
Educational and Psychological…	7
Psychometrika	7
Educational Measurement:…	6
Structural Equation Modeling:…	6
Applied Psychological…	5
National Center for Education…	4
Advances in Physiology…	3
Educational Evaluation and…	3
International Journal of…	3
Journal of Chemical Education	3
Practical Assessment,…	3
Research Papers in Education	3
Assessment in Education:…	2
EdSource	2
Educational Studies	2
Educational Testing Service	2
Health Education & Behavior	2
IEEE Transactions on Education	2
Journal of College Science…	2
Journal of Educational…	2
Journal of Medical Education	2
Language Assessment Quarterly	2
Language Testing	2
Measurement and Evaluation in…	2
More ▼

Raykov, Tenko	8
Marcoulides, George A.	4
Zumbo, Bruno D.	3
Andrich, David	2
Coe, Robert	2
Doran, Harold C.	2
Drummond, Gordon B.	2
Gierl, Mark J.	2
Rothstein, Richard	2
Vowler, Sarah L.	2
van der Linden, Wim J.	2
Al-A'ali, Mansoor	1
Al-Kattan, Khaled	1
Al-Sabah, Walid S.	1
Aliotta, Marialuisa	1
Allalouf, Avi	1
Allen, Jessica	1
Anani Sarab, Mohammad Reza	1
Archer, Elizabeth	1
Atkinson, Cheryl	1
Bargagliotti, Anna	1
Bartroff, Jay	1
Barua, Rashmi	1
Bason, Mark	1
More ▼

National Assessment of…	6
Program for International…	4
SAT (College Admission Test)	4
ACT Assessment	3
Early Childhood Longitudinal…	3
Comprehensive Tests of Basic…	2
Stanford Achievement Tests	2
Armed Services Vocational…	1
Baccalaureate and Beyond…	1
California Achievement Tests	1
Connecticut Mastery Testing…	1
Dynamic Indicators of Basic…	1
Expressive One Word Picture…	1
Fast Response Survey System	1
General Educational…	1
Graduate Record Examinations	1
International Association for…	1
Iowa Tests of Basic Skills	1
NEO Personality Inventory	1
National Adult Literacy…	1
National Assessment of Adult…	1
National Household Education…	1
National Public Education…	1
Praxis Series	1
Preliminary Scholastic…	1
More ▼