ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	2

Descriptor

Probability	6
Statistical Analysis	6
Testing Problems	6
Achievement Tests	2
Cheating	2
Evaluation Methods	2
Item Response Theory	2
Mathematical Models	2
Bayesian Statistics	1
Comparative Analysis	1
Computer Assisted Testing	1
Correlation	1
English (Second Language)	1
Error of Measurement	1
Evaluation Criteria	1
Hypothesis Testing	1
Identification	1
Language Tests	1
Mathematical Formulas	1
Maximum Likelihood Statistics	1
Multiple Choice Tests	1
Regression (Statistics)	1
Research Problems	1
Responses	1
Sample Size	1
More ▼

Source

ETS Research Report Series	1
Educational and Psychological…	1
Journal of Educational…	1

Author

Wilcox, Rand R.	2
Choi, Seung W.	1
Echternacht, Gary	1
Haberman, Shelby J.	1
Kim, Dong-In	1
Lee, Yi-Hsuan	1
Madsen, Harold S.	1
Sinharay, Sandip	1
Wan, Ping	1
Whitaker, Mike	1
Zhang, Litong	1
More ▼

Publication Type

Reports - Research	5
Journal Articles	2
Collected Works - General	1
Reports - Evaluative	1
Speeches/Meeting Papers	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Indiana Statewide Testing for…

What Works Clearinghouse Rating

Showing all 6 results Save | Export

A Statistical Procedure for Testing Unusually Frequent Exactly Matching Responses and Nearly Matching Responses. Research Report. ETS RR-17-23

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J.; Lee, Yi-Hsuan – ETS Research Report Series, 2017

In investigations of unusual testing behavior, a common question is whether a specific pattern of responses occurs unusually often within a group of examinees. In many current tests, modern communication techniques can permit quite large numbers of examinees to share keys, or common response patterns, to the entire test. To address this issue,…

Descriptors: Student Evaluation, Testing, Item Response Theory, Maximum Likelihood Statistics

Determining the Overall Impact of Interruptions during Online Testing

Peer reviewed

Direct link

Sinharay, Sandip; Wan, Ping; Whitaker, Mike; Kim, Dong-In; Zhang, Litong; Choi, Seung W. – Journal of Educational Measurement, 2014

With an increase in the number of online tests, interruptions during testing due to unexpected technical issues seem unavoidable. For example, interruptions occurred during several recent state tests. When interruptions occur, it is important to determine the extent of their impact on the examinees' scores. There is a lack of research on this…

Descriptors: Computer Assisted Testing, Testing Problems, Scores, Regression (Statistics)

A Quick Method for Determining Test Bias

Peer reviewed

Echternacht, Gary – Educational and Psychological Measurement, 1974

Descriptors: Evaluation Criteria, Probability, Statistical Analysis, Test Bias

A Two-Stage Procedure for Selecting the Best of Several Binomial Populations; [and] Some Exact Sample Sizes for Comparing the Squared Multiple Correlation Coefficient to a Standard; [and] An Improved Decision-Theoretic Coefficient for Tests. Studies in Measurement and Methodology, Work Unit 3: Technical Adequacy of Tests.

Wilcox, Rand R. – 1979

Three separate papers are included in this report. The first describes a two-stage procedure for choosing from among several instructional programs the one which maximizes the probability of passing the test. The second gives the exact sample sizes required to determine whether a squared multiple correlation coefficient is above or below a known…

Descriptors: Bayesian Statistics, Correlation, Hypothesis Testing, Mathematical Models

An Alternative Interpretation of Three Stability Models. Measurement and Methodology, Work Unit 2: Technical Adequacy of Tests.

Wilcox, Rand R. – 1978

Two fundamental problems in mental test theory are to estimate true score and to estimate the amount of error when testing an examinee. In this report, three probability models which characterize a single test item in terms of a population of examinees are described. How these models may be modified to characterize a single examinee in terms of an…

Descriptors: Achievement Tests, Comparative Analysis, Error of Measurement, Mathematical Models

Utilizing Rasch Analysis to Detect Cheating on Language Examinations.

Madsen, Harold S. – 1987

A study investigated the effectiveness of the Rasch procedure in measuring response appropriateness, especially for the detection of cheating on multiple-choice language tests. The report gives background information on appropriateness measurement and its potential uses, reviews recent research on cheating and its detection, and describes three…

Descriptors: Cheating, English (Second Language), Evaluation Methods, Language Tests