ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	6
Since 2007 (last 20 years)	9

Descriptor

Error of Measurement	14
Models	14
Test Reliability	14
Scores	5
Item Response Theory	4
Measurement Techniques	4
Test Items	4
Test Length	4
Test Validity	4
Correlation	3
Evaluation Methods	3
Item Analysis	3
Measurement	3
Prediction	3
Psychometrics	3
Comparative Analysis	2
Elementary Secondary Education	2
Foreign Countries	2
Sample Size	2
Scoring	2
Simulation	2
Social Science Research	2
Statistical Analysis	2
Student Evaluation	2
Test Interpretation	2
More ▼

Source

Applied Psychological…	1
Assessment & Evaluation in…	1
ETS Research Report Series	1
Educational Assessment	1
Educational Sciences: Theory…	1
Educational and Psychological…	1
International Journal of…	1
International Journal of…	1
Journal of Education and…	1
Journal of Educational…	1
Measurement:…	1
More ▼

Publication Type

Journal Articles	10
Reports - Research	6
Reports - Descriptive	3
Reports - Evaluative	2
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	1
Higher Education	1

Audience

Location

Germany

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Evaluating the Discrepancy between Scale Reliability and Cronbach's Coefficient Alpha Using Latent Variable Modeling

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A. – Measurement: Interdisciplinary Research and Perspectives, 2023

This article outlines a readily applicable procedure for point and interval estimation of the population discrepancy between reliability and the popular Cronbach's coefficient alpha for unidimensional multi-component measuring instruments with uncorrelated errors, which are widely used in behavioral and social research. The method is developed…

Descriptors: Measurement, Test Reliability, Measurement Techniques, Error of Measurement

A Simple Model to Determine the Efficient Duration of Exams

Peer reviewed

Direct link

Ellis, Jules L. – Educational and Psychological Measurement, 2021

This study develops a theoretical model for the costs of an exam as a function of its duration. Two kind of costs are distinguished: (1) the costs of measurement errors and (2) the costs of the measurement. Both costs are expressed in time of the student. Based on a classical test theory model, enriched with assumptions on the context, the costs…

Descriptors: Test Length, Models, Error of Measurement, Measurement

What Was I Thinking? A Theoretical Framework for Analysing Panel Conditioning in Attitudes and (Response) Behaviour

Peer reviewed

Direct link

Bergmann, Michael; Barth, Alice – International Journal of Social Research Methodology, 2018

Though panel data are increasingly used in the social sciences, the question whether repeatedly participating in a panel survey affects respondents' attitudes and (response) behaviour is still largely unsolved. Drawing on a model of associative networks that is extended by assumptions on survey satisficing, we present a theoretical framework that…

Descriptors: Models, Foreign Countries, Attribution Theory, Prediction

Reliability Estimates for Undergraduate Grade Point Average

Peer reviewed

Direct link

Westrick, Paul A. – Educational Assessment, 2017

Undergraduate grade point average (GPA) is a commonly employed measure in educational research, serving as a criterion or as a predictor depending on the research question. Over the decades, researchers have used a variety of reliability coefficients to estimate the reliability of undergraduate GPA, which suggests that there has been no consensus…

Descriptors: Undergraduate Students, Test Reliability, College Entrance Examinations, Longitudinal Studies

Examination of Polytomous Items' Psychometric Properties According to Nonparametric Item Response Theory Models in Different Test Conditions

Peer reviewed
PDF on ERIC

Download full text

Sengul Avsar, Asiye; Tavsancil, Ezel – Educational Sciences: Theory and Practice, 2017

This study analysed polytomous items' psychometric properties according to nonparametric item response theory (NIRT) models. Thus, simulated datasets--three different test lengths (10, 20 and 30 items), three sample distributions (normal, right and left skewed) and three samples sizes (100, 250 and 500)--were generated by conducting 20…

Descriptors: Test Items, Psychometrics, Nonparametric Statistics, Item Response Theory

Examination of Different Item Response Theory Models on Tests Composed of Testlets

Peer reviewed
PDF on ERIC

Download full text

Kogar, Esin Yilmaz; Kelecioglu, Hülya – Journal of Education and Learning, 2017

The purpose of this research is to first estimate the item and ability parameters and the standard error values related to those parameters obtained from Unidimensional Item Response Theory (UIRT), bifactor (BIF) and Testlet Response Theory models (TRT) in the tests including testlets, when the number of testlets, number of independent items, and…

Descriptors: Item Response Theory, Models, Mathematics Tests, Test Items

Multinomial and Compound Multinomial Error Models for Tests with Complex Item Scoring

Peer reviewed

Direct link

Lee, Won-Chan – Applied Psychological Measurement, 2007

This article introduces a multinomial error model, which models an examinee's test scores obtained over repeated measurements of an assessment that consists of polytomously scored items. A compound multinomial error model is also introduced for situations in which items are stratified according to content categories and/or prespecified numbers of…

Descriptors: Simulation, Error of Measurement, Scoring, Test Items

Comparison of Multistage Tests with Computerized Adaptive and Paper-and-Pencil Tests. Research Report. ETS RR-07-04

Peer reviewed
PDF on ERIC

Download full text

Rotou, Ourania; Patsula, Liane; Steffen, Manfred; Rizavi, Saba – ETS Research Report Series, 2007

Traditionally, the fixed-length linear paper-and-pencil (P&P) mode of administration has been the standard method of test delivery. With the advancement of technology, however, the popularity of administering tests using adaptive methods like computerized adaptive testing (CAT) and multistage testing (MST) has grown in the field of measurement…

Descriptors: Comparative Analysis, Test Format, Computer Assisted Testing, Models

Latent Trait Estimation: Theory vs. Practice.

Download full text

Kolakowski, Donald – 1972

Empirical results are presented as regards the implementation of a latent-trait psychometric model by means of conditional maximum likelihood estimation. Items are scored polychotomously into varying numbers of nominal categories and the test and item characteristic curves and information functions are examined. It is concluded that scoring items…

Descriptors: Error of Measurement, Item Analysis, Item Sampling, Measurement Techniques

Multiple Choice and True/False Tests: Reliability Measures and Some Implications of Negative Marking

Peer reviewed

Direct link

Burton, Richard F. – Assessment & Evaluation in Higher Education, 2004

The standard error of measurement usefully provides confidence limits for scores in a given test, but is it possible to quantify the reliability of a test with just a single number that allows comparison of tests of different format? Reliability coefficients do not do this, being dependent on the spread of examinee attainment. Better in this…

Descriptors: Multiple Choice Tests, Error of Measurement, Test Reliability, Test Items

An Evaluation Model for Mastery Testing

Peer reviewed

Emrick, John A. – Journal of Educational Measurement, 1971

Descriptors: Criterion Referenced Tests, Error of Measurement, Evaluation Methods, Item Analysis

Simplex Analyses of the Test Data: What Can They Tell Us?

Frayer, Dorothy A. – 1971

A Paradigm for testing concept attainment, comprised of twelve tasks, was formulated. These tasks were hypothesized to form a cumulative hierarchy. Tests were constructed in mathematics and social studies using the paradigm. Data for these tests was analyzed by Kaiser's method for fitting a perfect simplex and Schonemann's method for fitting a…

Descriptors: Concept Formation, Correlation, Data Analysis, Error of Measurement

Considerations for Creating Multi-Language Personality Norms: A Three-Component Model of Error

Peer reviewed

Direct link

Meyer, Kevin D.; Foster, Jeff L. – International Journal of Testing, 2008

With the increasing globalization of human resources practices, a commensurate increase in demand has occurred for multi-language ("global") personality norms for use in selection and development efforts. The combination of data from multiple translations of a personality assessment into a single norm engenders error from multiple sources. This…

Descriptors: Global Approach, Cultural Differences, Norms, Human Resources

On Meaningful Measurement: Issues of Reliability and Validity from a Humanistic Constructivist Information-Processing Perspective.

Download full text

Cheung, K. C. – 1993

In the past decade, there have been ample interests in the assessment of cognitive and affective processes and products for the purposes of meaningful learning. Meaningful measurement (MM) has been proposed which is in accordance with a humanistic constructivist information-processing perspective. Students' responses to the assessment tasks are…

Descriptors: Affective Behavior, Cognitive Processes, Constructivism (Learning), Educational Assessment

Barth, Alice	1
Bergmann, Michael	1
Burton, Richard F.	1
Cheung, K. C.	1
Ellis, Jules L.	1
Emrick, John A.	1
Foster, Jeff L.	1
Frayer, Dorothy A.	1
Kelecioglu, Hülya	1
Kogar, Esin Yilmaz	1
Kolakowski, Donald	1
Lee, Won-Chan	1
Marcoulides, George A.	1
Meyer, Kevin D.	1
Patsula, Liane	1
Raykov, Tenko	1
Rizavi, Saba	1
Rotou, Ourania	1
Sengul Avsar, Asiye	1
Steffen, Manfred	1
Tavsancil, Ezel	1
Westrick, Paul A.	1
More ▼