ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	6

Descriptor

Probability	10
Testing Problems	10
Item Response Theory	4
Computer Assisted Testing	3
Scores	3
Evaluation Methods	2
Foreign Countries	2
Mastery Tests	2
Mathematics Tests	2
Monte Carlo Methods	2
Sampling	2
Statistical Analysis	2
Test Validity	2
Test Wiseness	2
Testing	2
Accuracy	1
Achievement Tests	1
Advanced Placement	1
Cheating	1
Classification	1
Cloze Procedure	1
Criterion Referenced Tests	1
Elementary Education	1
Elementary School Students	1
Elementary Secondary Education	1
More ▼

Source

Educational and Psychological…	3
Applied Measurement in…	1
ETS Research Report Series	1
Educational Measurement:…	1
International Journal of…	1
Journal of Educational…	1
Journal of Research in Reading	1
Psychometrika	1

Publication Type

Journal Articles	10
Reports - Research	9
Reports - Descriptive	1

Education Level

Middle Schools

Audience

Location

Germany

Laws, Policies, & Programs

Assessments and Surveys

Indiana Statewide Testing for…

What Works Clearinghouse Rating

Showing all 10 results Save | Export

Explaining Performance Decline over the Course of Taking Comprehensive Proficiency Tests: The Roles of Effort and Omission Propensity

Peer reviewed

Direct link

Karoline A. Sachse; Sebastian Weirich; Nicole Mahler; Camilla Rjosk – International Journal of Testing, 2024

In order to ensure content validity by covering a broad range of content domains, the testing times of some educational large-scale assessments last up to a total of two hours or more. Performance decline over the course of taking the test has been extensively documented in the literature. It can occur due to increases in the numbers of: (a)…

Descriptors: Test Wiseness, Test Score Decline, Testing Problems, Foreign Countries

Estimating Probabilities of Passing for Examinees with Incomplete Data in Mastery Tests

Peer reviewed

Direct link

Sinharay, Sandip – Educational and Psychological Measurement, 2022

Administrative problems such as computer malfunction and power outage occasionally lead to missing item scores and hence to incomplete data on mastery tests such as the AP and U.S. Medical Licensing examinations. Investigators are often interested in estimating the probabilities of passing of the examinees with incomplete data on mastery tests.…

Descriptors: Mastery Tests, Computer Assisted Testing, Probability, Test Wiseness

A Statistical Procedure for Testing Unusually Frequent Exactly Matching Responses and Nearly Matching Responses. Research Report. ETS RR-17-23

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J.; Lee, Yi-Hsuan – ETS Research Report Series, 2017

In investigations of unusual testing behavior, a common question is whether a specific pattern of responses occurs unusually often within a group of examinees. In many current tests, modern communication techniques can permit quite large numbers of examinees to share keys, or common response patterns, to the entire test. To address this issue,…

Descriptors: Student Evaluation, Testing, Item Response Theory, Maximum Likelihood Statistics

Examining Estimates of Intervention Effectiveness Using Sensitivity Analysis

Peer reviewed

Direct link

An, Chen; Braun, Henry; Walsh, Mary E. – Educational Measurement: Issues and Practice, 2018

Making causal inferences from a quasi-experiment is difficult. Sensitivity analysis approaches to address hidden selection bias thus have gained popularity. This study serves as an introduction to a simple but practical form of sensitivity analysis using Monte Carlo simulation procedures. We examine estimated treatment effects for a school-based…

Descriptors: Statistical Inference, Intervention, Program Effectiveness, Quasiexperimental Design

Determining the Overall Impact of Interruptions during Online Testing

Peer reviewed

Direct link

Sinharay, Sandip; Wan, Ping; Whitaker, Mike; Kim, Dong-In; Zhang, Litong; Choi, Seung W. – Journal of Educational Measurement, 2014

With an increase in the number of online tests, interruptions during testing due to unexpected technical issues seem unavoidable. For example, interruptions occurred during several recent state tests. When interruptions occur, it is important to determine the extent of their impact on the examinees' scores. There is a lack of research on this…

Descriptors: Computer Assisted Testing, Testing Problems, Scores, Regression (Statistics)

Impact of Design Effects in Large-Scale District and State Assessments

Peer reviewed

Direct link

Phillips, Gary W. – Applied Measurement in Education, 2015

This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…

Descriptors: State Programs, Sampling, Research Design, Error of Measurement

Exact Tests for the Rasch Model via Sequential Importance Sampling

Peer reviewed

Direct link

Chen, Yuguo; Small, Dylan – Psychometrika, 2005

Rasch proposed an exact conditional inference approach to testing his model but never implemented it because it involves the calculation of a complicated probability. This paper furthers Rasch's approach by (1) providing an efficient Monte Carlo methodology for accurately approximating the required probability and (2) illustrating the usefulness…

Descriptors: Testing Problems, Probability, Methods, Testing

Applying Ranking and Selection Techniques to Determine the Length of a Mastery Test.

Peer reviewed

Wilcox, Rand R. – Educational and Psychological Measurement, 1979

A problem of considerable importance in certain educational settings is determining how many items to include on a mastery test. Applying ranking and selection procedures, a solution is given which includes as a special case all existing single-stage, non-Bayesian solutions based on a strong true-score model. (Author/JKS)

Descriptors: Criterion Referenced Tests, Mastery Tests, Nonparametric Statistics, Probability

The Cloze Procedure Applied to a Probability Concepts Test.

Peer reviewed

Green, D. R.; Tomlinson, M. – Journal of Research in Reading, 1983

Confirms that in cloze testing, it is unnecessary to use standard size spaces and reveals a high correlation between synonymic scoring and verbatim scoring. Indicates also that a specific probability concepts test is comprehensible and readable by the great majority of students for whom it was devised. (FL)

Descriptors: Cloze Procedure, Elementary Secondary Education, Listening Skills, Probability

Testing with Personal Probabilities: 11-Year-Olds Can Correctly Estimate Their Personal Probabilities.

Peer reviewed

Dirkzwager, A. – Educational and Psychological Measurement, 1996

Testing with personal probabilities eliminates guessing whether the subjects are well calibrated. A probability testing study with 47 Dutch elementary school children who used an interactive computer program shows that even 11-year-olds can estimate their personal probabilities correctly. (SLD)

Descriptors: Computer Assisted Testing, Elementary Education, Elementary School Students, Estimation (Mathematics)

Sinharay, Sandip	2
An, Chen	1
Braun, Henry	1
Camilla Rjosk	1
Chen, Yuguo	1
Choi, Seung W.	1
Dirkzwager, A.	1
Green, D. R.	1
Haberman, Shelby J.	1
Karoline A. Sachse	1
Kim, Dong-In	1
Lee, Yi-Hsuan	1
Nicole Mahler	1
Phillips, Gary W.	1
Sebastian Weirich	1
Small, Dylan	1
Tomlinson, M.	1
Walsh, Mary E.	1
Wan, Ping	1
Whitaker, Mike	1
Wilcox, Rand R.	1
Zhang, Litong	1
More ▼