ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	22

Descriptor

Probability	28
Statistical Analysis	28
Test Items	28
Models	8
Item Response Theory	6
Test Bias	6
College Entrance Examinations	5
Difficulty Level	5
Equated Scores	5
Comparative Analysis	4
Computation	4
Foreign Countries	4
Test Construction	4
Classification	3
Error of Measurement	3
Mathematical Models	3
Multiple Choice Tests	3
Psychometrics	3
Sampling	3
Scores	3
Scoring	3
Simulation	3
Ability	2
Achievement Tests	2
Bayesian Statistics	2
More ▼

Source

ETS Research Report Series	4
Journal of Educational and…	4
Educational and Psychological…	3
International Journal of…	2
Psychometrika	2
Applied Measurement in…	1
Applied Psychological…	1
EURASIA Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Cognition and…	1
Measurement:…	1
Physical Review Special…	1
ProQuest LLC	1
Structural Equation Modeling:…	1
More ▼

Publication Type

Journal Articles	24
Reports - Research	18
Reports - Evaluative	5
Reports - Descriptive	2
Dissertations/Theses -…	1
Guides - Classroom - Teacher	1
Numerical/Quantitative Data	1
Reference Materials -…	1
Reports - General	1
Tests/Questionnaires	1

Education Level

Higher Education	8
Postsecondary Education	6
Elementary Education	1

Audience

Location

Colorado	1
Italy	1
Netherlands (Amsterdam)	1
Sweden	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	2
National Assessment of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 28 results Save | Export

Interval Estimation of Item Response Probabilities along Studied Latent Dimensions

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A.; Pusic, Martin – Measurement: Interdisciplinary Research and Perspectives, 2021

An interval estimation procedure is discussed that can be used to evaluate the probability of a particular response for a binary or binary scored item at a pre-specified point along an underlying latent continuum. The item is assumed to: (a) be part of a unidimensional multi-component measuring instrument that may contain also polytomous items,…

Descriptors: Item Response Theory, Computation, Probability, Test Items

Testing the Within-State Distribution in Mixture Models for Responses and Response Times

Peer reviewed

Direct link

Kuijpers, Renske E.; Visser, Ingmar; Molenaar, Dylan – Journal of Educational and Behavioral Statistics, 2021

Mixture models have been developed to enable detection of within-subject differences in responses and response times to psychometric test items. To enable mixture modeling of both responses and response times, a distributional assumption is needed for the within-state response time distribution. Since violations of the assumed response time…

Descriptors: Test Items, Responses, Reaction Time, Models

Equality of Admission Tests Using Kernel Equating under the Non-Equivalent Groups with Covariates Design

Peer reviewed
PDF on ERIC

Download full text

Altintas, Ozge; Wallin, Gabriel – International Journal of Assessment Tools in Education, 2021

Educational assessment tests are designed to measure the same psychological constructs over extended periods. This feature is important considering that test results are often used for admittance to university programs. To ensure fair assessments, especially for those whose results weigh heavily in selection decisions, it is necessary to collect…

Descriptors: College Admission, College Entrance Examinations, Test Bias, Equated Scores

Kernel Equating Using Propensity Scores for Nonequivalent Groups

Peer reviewed

Direct link

Wallin, Gabriel; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2019

When equating two test forms, the equated scores will be biased if the test groups differ in ability. To adjust for the ability imbalance between nonequivalent groups, a set of common items is often used. When no common items are available, it has been suggested to use covariates correlated with the test scores instead. In this article, we reduce…

Descriptors: Equated Scores, Test Items, Probability, College Entrance Examinations

Evaluating the Effectiveness of the Expectation-Maximization (EM) Algorithm for Bayesian Network Calibration

Direct link

Tingir, Seyfullah – ProQuest LLC, 2019

Educators use various statistical techniques to explain relationships between latent and observable variables. One way to model these relationships is to use Bayesian networks as a scoring model. However, adjusting the conditional probability tables (CPT-parameters) to fit a set of observations is still a challenge when using Bayesian networks. A…

Descriptors: Bayesian Statistics, Statistical Analysis, Scoring, Probability

Definite Integral Automatic Analysis Mechanism Research and Development Using the "Find the Area by Integration" Unit as an Example

Peer reviewed

Direct link

Ting, Mu Yu – EURASIA Journal of Mathematics, Science & Technology Education, 2017

Using the capabilities of expert knowledge structures, the researcher prepared test questions on the university calculus topic of "finding the area by integration." The quiz is divided into two types of multiple choice items (one out of four and one out of many). After the calculus course was taught and tested, the results revealed that…

Descriptors: Calculus, Mathematics Instruction, College Mathematics, Multiple Choice Tests

Equating without an Anchor for Nonequivalent Groups of Examinees

Peer reviewed

Direct link

Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2015

An equating procedure for a testing program with evolving distribution of examinee profiles is developed. No anchor is available because the original scoring scheme was based on expert judgment of the item difficulties. Pairs of examinees from two administrations are formed by matching on coarsened propensity scores derived from a set of…

Descriptors: Equated Scores, Testing Programs, College Entrance Examinations, Scoring

An Algorithm to Improve Test Answer Copying Detection Using the Omega Statistic

Peer reviewed

Direct link

Maeda, Hotaka; Zhang, Bo – International Journal of Testing, 2017

The omega (?) statistic is reputed to be one of the best indices for detecting answer copying on multiple choice tests, but its performance relies on the accurate estimation of copier ability, which is challenging because responses from the copiers may have been contaminated. We propose an algorithm that aims to identify and delete the suspected…

Descriptors: Cheating, Test Items, Mathematics, Statistics

An Exploratory Analysis of Differential Item Functioning and Its Possible Sources in a Higher Education Admissions Context

Peer reviewed

Direct link

Oliveri, Maria Elena; Lawless, Rene; Robin, Frederic; Bridgeman, Brent – Applied Measurement in Education, 2018

We analyzed a pool of items from an admissions test for differential item functioning (DIF) for groups based on age, socioeconomic status, citizenship, or English language status using Mantel-Haenszel and item response theory. DIF items were systematically examined to identify its possible sources by item type, content, and wording. DIF was…

Descriptors: Test Bias, Comparative Analysis, Item Banks, Item Response Theory

Studying Differential Item Functioning via Latent Variable Modeling: A Note on a Multiple-Testing Procedure

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A.; Lee, Chun-Lung; Chang, Chi – Educational and Psychological Measurement, 2013

This note is concerned with a latent variable modeling approach for the study of differential item functioning in a multigroup setting. A multiple-testing procedure that can be used to evaluate group differences in response probabilities on individual items is discussed. The method is readily employed when the aim is also to locate possible…

Descriptors: Test Bias, Statistical Analysis, Models, Hypothesis Testing

Modeling Answer Changes on Test Items

Peer reviewed

Direct link

van der Linden, Wim J.; Jeon, Minjeong – Journal of Educational and Behavioral Statistics, 2012

The probability of test takers changing answers upon review of their initial choices is modeled. The primary purpose of the model is to check erasures on answer sheets recorded by an optical scanner for numbers and patterns that may be indicative of irregular behavior, such as teachers or school administrators changing answer sheets after their…

Descriptors: Probability, Models, Test Items, Educational Testing

Exploring Crossing Differential Item Functioning by Gender in Mathematics Assessment

Peer reviewed

Direct link

Ong, Yoke Mooi; Williams, Julian; Lamprianou, Iasonas – International Journal of Testing, 2015

The purpose of this article is to explore crossing differential item functioning (DIF) in a test drawn from a national examination of mathematics for 11-year-old pupils in England. An empirical dataset was analyzed to explore DIF by gender in a mathematics assessment. A two-step process involving the logistic regression (LR) procedure for…

Descriptors: Mathematics Tests, Gender Differences, Test Bias, Test Items

Testing Measurement Invariance Using MIMIC: Likelihood Ratio Test with a Critical Value Adjustment

Peer reviewed

Direct link

Kim, Eun Sook; Yoon, Myeongsun; Lee, Taehun – Educational and Psychological Measurement, 2012

Multiple-indicators multiple-causes (MIMIC) modeling is often used to test a latent group mean difference while assuming the equivalence of factor loadings and intercepts over groups. However, this study demonstrated that MIMIC was insensitive to the presence of factor loading noninvariance, which implies that factor loading invariance should be…

Descriptors: Test Items, Simulation, Testing, Statistical Analysis

Quantum Mechanics Concept Assessment: Development and Validation Study

Peer reviewed

Direct link

Sadaghiani, Homeyra R.; Pollock, Steven J. – Physical Review Special Topics - Physics Education Research, 2015

As part of an ongoing investigation of students' learning in first semester upper-division quantum mechanics, we needed a high-quality conceptual assessment instrument for comparing outcomes of different curricular approaches. The process of developing such a tool started with converting a preliminary version of a 14-item open-ended quantum…

Descriptors: Science Instruction, Quantum Mechanics, Mechanics (Physics), Multiple Choice Tests

Formulating the Rasch Differential Item Functioning Model under the Marginal Maximum Likelihood Estimation Context and Its Comparison with Mantel-Haenszel Procedure in Short Test and Small Sample Conditions

Peer reviewed

Direct link

Paek, Insu; Wilson, Mark – Educational and Psychological Measurement, 2011

This study elaborates the Rasch differential item functioning (DIF) model formulation under the marginal maximum likelihood estimation context. Also, the Rasch DIF model performance was examined and compared with the Mantel-Haenszel (MH) procedure in small sample and short test length conditions through simulations. The theoretically known…

Descriptors: Test Bias, Test Length, Statistical Inference, Geometric Concepts

Previous Page | Next Page »

Pages: 1 | 2

Marcoulides, George A.	2
Raykov, Tenko	2
Wallin, Gabriel	2
van der Linden, Wim J.	2
Al-Sabah, Walid S.	1
Altintas, Ozge	1
Asparouhov, Tihomir	1
Bartolucci, F.	1
Braeken, Johan	1
Bridgeman, Brent	1
Chang, Chi	1
De Boeck, Paul	1
Futagi, Yoko	1
Guo, Hongwen	1
Hemat, Ramin	1
Holland, Paul W.	1
Huttenlocher, Janellen	1
Jeon, Minjeong	1
Jeong, Yoonkyung	1
Joarder, Anwar H.	1
Kim, Eun Sook	1
Kostin, Irene	1
Kuijpers, Renske E.	1
Lamprianou, Iasonas	1
Lawless, Rene	1
More ▼