ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	12

Descriptor

Measurement Techniques	30
Test Items	30
Test Theory	30
Psychometrics	12
Difficulty Level	9
Mathematical Models	9
Item Analysis	8
Latent Trait Theory	8
Test Construction	8
Models	7
Test Validity	6
Testing	6
Testing Problems	6
Classification	5
Correlation	5
Definitions	5
Evaluation Methods	5
Measurement	5
Scoring	5
Statistical Analysis	5
Test Bias	5
Culture Fair Tests	4
Diagnostic Tests	4
Educational Assessment	4
Evaluation Problems	4
More ▼

Source

Measurement:…	4
Journal of Educational…	2
Applied Psychological…	1
College Board	1
Educational and Psychological…	1
International Journal of…	1
International Journal of…	1
Journal of Educational and…	1
Review of Research in…	1
SAGE Open	1

Publication Type

Reports - Research	17
Journal Articles	13
Speeches/Meeting Papers	8
Opinion Papers	5
Reports - Evaluative	4
ERIC Digests in Full Text	2
ERIC Publications	2
Historical Materials	1
Reports - Descriptive	1
Tests/Questionnaires	1

Education Level

Elementary Secondary Education	3
Elementary Education	2
Early Childhood Education	1
Grade 2	1
Grade 6	1
High Schools	1
Higher Education	1
Intermediate Grades	1
Middle Schools	1
Postsecondary Education	1
Primary Education	1
Secondary Education	1
More ▼

Audience

Researchers	3
Practitioners	1

Location

Turkey	1
Uganda	1
United Kingdom	1
United States	1

Laws, Policies, & Programs

Individuals with Disabilities…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

SAT (College Admission Test)	2
Students Evaluation of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 30 results Save | Export

Assessment of Item and Test Parameters: Cosine Similarity Approach

Peer reviewed
PDF on ERIC

Download full text

Chakrabartty, Satyendra Nath – International Journal of Psychology and Educational Studies, 2021

The paper proposes new measures of difficulty and discriminating values of binary items and test consisting of such items and find their relationships including estimation of test error variance and thereby the test reliability, as per definition using cosine similarities. The measures use entire data. Difficulty value of test and item is defined…

Descriptors: Test Items, Difficulty Level, Scores, Test Reliability

Examination of Common Exams Held by Measurement and Assessment Centers: Many Facet Rasch Analysis

Peer reviewed
PDF on ERIC

Download full text

Kaya Uyanik, Gulden; Demirtas Tolaman, Tugba; Gur Erdogan, Duygu – International Journal of Assessment Tools in Education, 2021

This paper aims to examine and assess the questions included in the "Turkish Common Exam" for sixth graders held in the first semester of 2018 which is one of the common exams carried out by The Measurement and Evaluation Centers, in terms of question structure, quality and taxonomic value. To this end, the test questions were examined…

Descriptors: Foreign Countries, Grade 6, Standardized Tests, Test Items

Day Scholars Food Insecurity Experience Scale-Survey Module (DSFIES-SM): Psychometric Analysis

Peer reviewed

Direct link

Ibrahim Kasujja; Hugo Melgar-Quinonez; Joweria Nambooze – SAGE Open, 2023

Background: School feeding programs' evaluation requires the measurement of food insecurity, a more objective indicator, within school in low-income countries. The Global Child Nutrition Foundation (GCNF) uses subjective indicators to report school feeding coverage rates across many countries that participate in the global survey of school meal…

Descriptors: Hunger, Food, Program Effectiveness, Psychometrics

Screening Test Items for Differential Item Functioning

Peer reviewed

Direct link

Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2014

A method for medical screening is adapted to differential item functioning (DIF). Its essential elements are explicit declarations of the level of DIF that is acceptable and of the loss function that quantifies the consequences of the two kinds of inappropriate classification of an item. Instead of a single level and a single function, sets of…

Descriptors: Test Items, Test Bias, Simulation, Hypothesis Testing

Rating Quality Studies Using Rasch Measurement Theory. Research Report 2013-3

Download full text

Engelhard, George, Jr.; Wind, Stefanie A. – College Board, 2013

The major purpose of this study is to examine the quality of ratings assigned to CR (constructed-response) questions in large-scale assessments from the perspective of Rasch Measurement Theory. Rasch Measurement Theory provides a framework for the examination of rating scale category structure that can yield useful information for interpreting the…

Descriptors: Measurement Techniques, Rating Scales, Test Theory, Scores

Coefficient Alpha and Reliability of Scale Scores

Peer reviewed

Direct link

Almehrizi, Rashid S. – Applied Psychological Measurement, 2013

The majority of large-scale assessments develop various score scales that are either linear or nonlinear transformations of raw scores for better interpretations and uses of assessment results. The current formula for coefficient alpha (a; the commonly used reliability coefficient) only provides internal consistency reliability estimates of raw…

Descriptors: Raw Scores, Scaling, Reliability, Computation

Conceptual Issues in Response-Time Modeling

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational Measurement, 2009

Two different traditions of response-time (RT) modeling are reviewed: the tradition of distinct models for RTs and responses, and the tradition of model integration in which RTs are incorporated in response models or the other way around. Several conceptual issues underlying both traditions are made explicit and analyzed for their consequences. We…

Descriptors: Test Items, Models, Reaction Time, Measurement

Some Notes on the Reinvention of Latent Structure Models as Diagnostic Classification Models

Peer reviewed

Direct link

von Davier, Matthias – Measurement: Interdisciplinary Research and Perspectives, 2009

In this commentary, the author points out few issues, one being that there are models mislabeled as diagnostic, which deal with linear decompositions of item difficulties rather than estimating multidimensional skill variables. The author discusses the issue that there are many new names for essentially well-known models for multiple simultaneous…

Descriptors: Test Items, Probability, Models, Diagnostic Tests

Diagnostic Classification Modeling: Opportunity for Identity

Peer reviewed

Direct link

Hancock, Gregory R. – Measurement: Interdisciplinary Research and Perspectives, 2009

As Rupp and Templin (2008) stated directly, diagnostic classification methods "are confirmatory in nature." Methods, though, are neither inherently confirmatory nor exploratory. Diagnostic classification modeling, with its analytical and computational obstacles eventually yielding as a comprehensive and potent discipline emerges, will…

Descriptors: Structural Equation Models, Test Items, Models, Diagnostic Tests

Have Cognitive Diagnostic Models Delivered Their Goods? Some Substantial and Methodological Concerns

Peer reviewed

Direct link

Wilhelm, Oliver; Robitzsch, Alexander – Measurement: Interdisciplinary Research and Perspectives, 2009

The paper by Rupp and Templin (2008) is an excellent work on the characteristics and features of cognitive diagnostic models (CDM). In this article, the authors comment on some substantial and methodological aspects of this focus paper. They organize their comments by going through issues associated with the terms "cognitive,"…

Descriptors: Research Methodology, Test Items, Models, Diagnostic Tests

Diagnostic Classification Models: Which One Should I Use?

Peer reviewed

Direct link

Jiao, Hong – Measurement: Interdisciplinary Research and Perspectives, 2009

Diagnostic assessment is currently an active research area in educational measurement. Literature related to diagnostic modeling has been in existence for several decades, but a great deal of research has been conducted within the last decade or so, especially within the last five years. The author summarizes the key components in the application…

Descriptors: Educational Assessment, Literature Reviews, Test Items, Probability

Classical Test Theory and Item Response Theory: Analytical and Empirical Comparisons.

Download full text

Hwang, Dae-Yeop – 2002

This study compared classical test theory (CTT) and item response theory (IRT). The behavior of the item and person statistics derived from these two measurement frameworks was examined analytically and empirically using a data set obtained from BILOG (R. Mislay and D. Block, 1997). The example was a 15-item test with a sample size of 600…

Descriptors: Comparative Analysis, Measurement Techniques, Scores, Statistical Distributions

Historical Views of Invariance: Evidence from the Measurement Theories of Thorndike, Thurstone, and Rasch.

Peer reviewed

Engelhard, George, Jr. – Educational and Psychological Measurement, 1992

A historical perspective is provided of the concept of invariance in measurement theory, describing sample-invariant item calibration and item-invariant measurement of individuals. Invariance as a key measurement concept is illustrated through the measurement theories of E. L. Thorndike, L. L. Thurstone, and G. Rasch. (SLD)

Descriptors: Behavioral Sciences, Educational History, Measurement Techniques, Psychometrics

On the Direct Measurement of Face Validity: A Comment on Nevo.

Peer reviewed

Secolsky, Charles – Journal of Educational Measurement, 1987

For measuring the face validity of a test, Nevo suggested that test takers and nonprofessional users rate items on a five point scale. This article questions the ability of those raters and the credibility of the aggregated judgment as evidence of the validity of the test. (JAZ)

Descriptors: Content Validity, Measurement Techniques, Rating Scales, Test Items

Item Bias: Mantel-Haenszel and the Rasch Model. Memorandum No. 39.

Download full text

Linacre, John M.; Wright, Benjamin D. – 1987

The Mantel-Haenszel (MH) procedure attempts to identify and quantify differential item performance (item bias). This paper summarizes the MH statistics, and identifies the parameters they estimate. An equivalent procedure based on the Rasch model is described. The theoretical properties of the two approaches are compared and shown to require the…

Descriptors: Algorithms, Estimation (Mathematics), Item Analysis, Measurement Techniques

Previous Page | Next Page »

Pages: 1 | 2

Engelhard, George, Jr.	2
Almehrizi, Rashid S.	1
Chakrabartty, Satyendra Nath	1
Demirtas Tolaman, Tugba	1
Eignor, Daniel R.	1
Frary, Robert B.	1
Gur Erdogan, Duygu	1
Haladyna, Tom	1
Hambleton, Ronald K.	1
Hancock, Gregory R.	1
Hocevar, Dennis	1
Hugo Melgar-Quinonez	1
Hwang, Dae-Yeop	1
Ibrahim Kasujja	1
Jiao, Hong	1
Joweria Nambooze	1
Kaya Uyanik, Gulden	1
Kehoe, Jerard	1
Kiely, Gerard L.	1
Levine, Michael V.	1
Linacre, John M.	1
Longford, Nicholas T.	1
Marsh, Herbert W.	1
Robitzsch, Alexander	1
Rogers, H. Jane	1
More ▼