ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	8

Descriptor

Error of Measurement	11
Probability	11
Psychometrics	11
Measurement Techniques	5
Testing	4
Item Response Theory	3
Models	3
Evaluation	2
Item Analysis	2
Measurement	2
Reliability	2
Sciences	2
Scores	2
Test Items	2
Academic Achievement	1
Algorithms	1
Auditory Perception	1
Auditory Stimuli	1
Bilingual Education	1
Bilingualism	1
Classification	1
College Students	1
Computer Simulation	1
Computer Software	1
Construct Validity	1
More ▼

Source

Psychometrika	3
Applied Psychological…	1
Educational Research	1
Educational Researcher	1
International Journal of…	1
International Online Journal…	1
Journal of Speech, Language,…	1
Oxford Review of Education	1
Psicologica: International…	1

Author

Andrich, David	1
Bramley, Tom	1
Dirkzwager, Arie	1
Draxler, Clemens	1
Ferrando, Pere J.	1
Hessen, David J.	1
Hutchison, Dougal	1
Nandur, Vuday	1
Sapienza, Christine M.	1
Sekercioglu, Güçlü	1
Shrivastav, Rahul	1
Sijtsma, Klaas	1
Solano-Flores, Guillermo	1
Tijmstra, Jesper	1
Warrens, Matthijs J.	1
van der Heijden, Peter G. M.	1
More ▼

Publication Type

Journal Articles	11
Reports - Research	5
Reports - Evaluative	4
Reports - Descriptive	2

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Location

United Kingdom (England)

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Measurement Invariance: Concept and Implementation

Peer reviewed
PDF on ERIC

Download full text

Sekercioglu, Güçlü – International Online Journal of Education and Teaching, 2018

An empirical evidence for independent samples of a population regarding measurement invariance implies that factor structure of a measurement tool is equal across these samples; in other words, it measures the intended psychological trait within the same structure. In this case, the evidence of construct validity would be strengthened within the…

Descriptors: Factor Analysis, Error of Measurement, Factor Structure, Construct Validity

Testing Manifest Monotonicity Using Order-Constrained Statistical Inference

Peer reviewed

Direct link

Tijmstra, Jesper; Hessen, David J.; van der Heijden, Peter G. M.; Sijtsma, Klaas – Psychometrika, 2013

Most dichotomous item response models share the assumption of latent monotonicity, which states that the probability of a positive response to an item is a nondecreasing function of a latent variable intended to be measured. Latent monotonicity cannot be evaluated directly, but it implies manifest monotonicity across a variety of observed scores,…

Descriptors: Item Response Theory, Statistical Inference, Probability, Psychometrics

Sample Size Determination for Rasch Model Tests

Peer reviewed

Direct link

Draxler, Clemens – Psychometrika, 2010

This paper is concerned with supplementing statistical tests for the Rasch model so that additionally to the probability of the error of the first kind (Type I probability) the probability of the error of the second kind (Type II probability) can be controlled at a predetermined level by basing the test on the appropriate number of observations.…

Descriptors: Statistical Analysis, Probability, Sample Size, Error of Measurement

Assessing Short-Term Individual Consistency Using IRT-Based Statistics

Peer reviewed
PDF on ERIC

Download full text

Ferrando, Pere J. – Psicologica: International Journal of Methodology and Experimental Psychology, 2010

This article proposes a procedure, based on a global statistic, for assessing intra-individual consistency in a test-retest design with a short-term retest interval. The procedure is developed within the framework of parametric item response theory, and the statistic is a likelihood-based measure that can be considered as an extension of the…

Descriptors: Item Response Theory, Intervals, Psychometrics, Testing

On Association Coefficients for 2x2 Tables and Properties that Do Not Depend on the Marginal Distributions

Peer reviewed

Direct link

Warrens, Matthijs J. – Psychometrika, 2008

We discuss properties that association coefficients may have in general, e.g., zero value under statistical independence, and we examine coefficients for 2x2 tables with respect to these properties. Furthermore, we study a family of coefficients that are linear transformations of the observed proportion of agreement given the marginal…

Descriptors: Probability, Error of Measurement, Psychometrics, Measurement Techniques

A Response to an Article Published in "Educational Research"'s Special Issue on Assessment (June 2009). What Can Be Inferred about Classification Accuracy from Classification Consistency?

Peer reviewed

Direct link

Bramley, Tom – Educational Research, 2010

Background: A recent article published in "Educational Research" on the reliability of results in National Curriculum testing in England (Newton, "The reliability of results from national curriculum testing in England," "Educational Research" 51, no. 2: 181-212, 2009) suggested that: (1) classification accuracy can be…

Descriptors: National Curriculum, Educational Research, Testing, Measurement

On the Conceptualisation of Measurement Error

Peer reviewed

Direct link

Hutchison, Dougal – Oxford Review of Education, 2008

There is a degree of instability in any measurement, so that if it is repeated, it is possible that a different result may be obtained. Such instability, generally described as "measurement error", may affect the conclusions drawn from an investigation, and methods exist for allowing it. It is less widely known that different disciplines, and…

Descriptors: Measurement Techniques, Data Analysis, Error of Measurement, Test Reliability

Who Is Given Tests in What Language by Whom, When, and Where? The Need for Probabilistic Views of Language in the Testing of English Language Learners

Peer reviewed

Direct link

Solano-Flores, Guillermo – Educational Researcher, 2008

The testing of English language learners (ELLs) is, to a large extent, a random process because of poor implementation and factors that are uncertain or beyond control. Yet current testing practices and policies appear to be based on deterministic views of language and linguistic groups and erroneous assumptions about the capacity of assessment…

Descriptors: Generalizability Theory, Testing, Second Language Learning, Error of Measurement

Application of Psychometric Theory to the Measurement of Voice Quality Using Rating Scales

Peer reviewed

Shrivastav, Rahul; Sapienza, Christine M.; Nandur, Vuday – Journal of Speech, Language, and Hearing Research, 2005

Rating scales are commonly used to study voice quality. However, recent research has demonstrated that perceptual measures of voice quality obtained using rating scales suffer from poor interjudge agreement and reliability, especially in the midrange of the scale. These findings, along with those obtained using multidimensional scaling (MDS), have…

Descriptors: Psychometrics, Probability, Rating Scales, Interrater Reliability

Multiple Evaluation: A New Testing Paradigm that Exorcizes Guessing

Peer reviewed

Direct link

Dirkzwager, Arie – International Journal of Testing, 2003

The crux in psychometrics is how to estimate the probability that a respondent answers an item correctly on one occasion out of many. Under the current testing paradigm this probability is estimated using all kinds of statistical techniques and mathematical modeling. Multiple evaluation is a new testing paradigm using the person's own personal…

Descriptors: Psychometrics, Probability, Models, Measurement

A Probabilistic IRT Model for Unfolding Preference Data.

Peer reviewed

Andrich, David – Applied Psychological Measurement, 1989

A probabilistic item response theory (IRT) model is developed for pair-comparison design in which the unfolding principle governing the choice process uses a discriminant process analogous to Thurstone's Law of Comparative Judgment. A simulation study demonstrates the feasibility of estimation, and two examples illustrate the implications for…

Descriptors: Algorithms, Computer Simulation, Discrimination Learning, Equations (Mathematics)