ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	9

Descriptor

Statistical Distributions	29
Testing Problems	29
Scoring	6
Statistical Analysis	6
Test Items	6
Goodness of Fit	5
Mathematical Models	5
Scores	5
Cheating	4
Elementary Secondary Education	4
Equated Scores	4
Evaluation Methods	4
Identification	4
Item Analysis	4
Latent Trait Theory	4
Norm Referenced Tests	4
Simulation	4
Test Norms	4
Testing Programs	4
Achievement Tests	3
Elementary Education	3
Equations (Mathematics)	3
Item Response Theory	3
Mathematics Tests	3
School Districts	3
More ▼

Source

Journal of Educational…	3
Journal of Educational and…	3
Educational and Psychological…	2
American Psychologist	1
Applied Measurement in…	1
Applied Psychological…	1
Educational Researcher	1
Grantee Submission	1
Journal on Efficiency and…	1
North American Chapter of the…	1
ProQuest LLC	1
Review of Educational Research	1
More ▼

Publication Type

Reports - Research	19
Journal Articles	14
Speeches/Meeting Papers	11
Reports - Evaluative	6
Information Analyses	3
Opinion Papers	2
Dissertations/Theses -…	1
Numerical/Quantitative Data	1
Reports - Descriptive	1

Education Level

High Schools	1
Higher Education	1
Postsecondary Education	1
Secondary Education	1

Audience

Researchers

Location

Czech Republic	1
Netherlands	1
Virginia	1

Laws, Policies, & Programs

Assessments and Surveys

Comprehensive Tests of Basic…	2
California Achievement Tests	1
Iowa Tests of Basic Skills	1
Metropolitan Achievement Tests	1
Stanford Achievement Tests	1

What Works Clearinghouse Rating

Showing 1 to 15 of 29 results Save | Export

A Robust Method for Detecting Item Misfit in Large-Scale Assessments

Peer reviewed

Direct link

von Davier, Matthias; Bezirhan, Ummugul – Educational and Psychological Measurement, 2023

Viable methods for the identification of item misfit or Differential Item Functioning (DIF) are central to scale construction and sound measurement. Many approaches rely on the derivation of a limiting distribution under the assumption that a certain model fits the data perfectly. Typical DIF assumptions such as the monotonicity and population…

Descriptors: Robustness (Statistics), Test Items, Item Analysis, Goodness of Fit

Lord's Equity Theorem Revisited

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2019

Lord's (1980) equity theorem claims observed-score equating to be possible only when two test forms are perfectly reliable or strictly parallel. An analysis of its proof reveals use of an incorrect statistical assumption. The assumption does not invalidate the theorem itself though, which can be shown to follow directly from the discrete nature of…

Descriptors: Equated Scores, Testing Problems, Item Response Theory, Evaluation Methods

High School Students' Misconceptions about Significance Testing with a Repeated Sampling Approach = Dificultades de estudiantes de bachillerato sobre pruebas de significación a través de un enfoque de muestreo repetido

Peer reviewed
PDF on ERIC

Download full text

Sánchez Sánchez, Ernesto; García Rios, Víctor N.; Silvestre Castro, Eleazar; Licea, Guadalupe Carrasco – North American Chapter of the International Group for the Psychology of Mathematics Education, 2020

In this paper, we address the following questions: What misconceptions do high school students exhibit in their first encounter with significance test problems through a repeated sampling approach? Which theory or framework could explain the presence and features of such patterns? With brief prior instruction on the use of Fathom software to…

Descriptors: High School Students, Misconceptions, Statistical Significance, Testing

A New Statistic for Detection of Aberrant Answer Changes

Peer reviewed

Direct link

Sinharay, Sandip; Duong, Minh Q.; Wood, Scott W. – Journal of Educational Measurement, 2017

As noted by Fremer and Olson, analysis of answer changes is often used to investigate testing irregularities because the analysis is readily performed and has proven its value in practice. Researchers such as Belov, Sinharay and Johnson, van der Linden and Jeon, van der Linden and Lewis, and Wollack, Cohen, and Eckerly have suggested several…

Descriptors: Identification, Statistics, Change, Tests

Detecting Fraudulent Erasures at an Aggregate Level

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2018

Wollack, Cohen, and Eckerly suggested the "erasure detection index" (EDI) to detect fraudulent erasures for individual examinees. Wollack and Eckerly extended the EDI to detect fraudulent erasures at the group level. The EDI at the group level was found to be slightly conservative. This article suggests two modifications of the EDI for…

Descriptors: Deception, Identification, Testing Problems, Cheating

Detection of Item Preknowledge Using Likelihood Ratio Test and Score Test

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2017

An increasing concern of producers of educational assessments is fraudulent behavior during the assessment (van der Linden, 2009). Benefiting from item preknowledge (e.g., Eckerly, 2017; McLeod, Lewis, & Thissen, 2003) is one type of fraudulent behavior. This article suggests two new test statistics for detecting individuals who may have…

Descriptors: Test Items, Cheating, Testing Problems, Identification

Detecting Fraudulent Erasures at an Aggregate Level

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip – Grantee Submission, 2017

Wollack, Cohen, and Eckerly (2015) suggested the "erasure detection index" (EDI) to detect fraudulent erasures for individual examinees. Wollack and Eckerly (2017) extended the EDI to detect fraudulent erasures at the group level. The EDI at the group level was found to be slightly conservative. This paper suggests two modifications of…

Descriptors: Deception, Identification, Testing Problems, Cheating

Comparison of the Test Variants in Entrance Examinations

Peer reviewed
PDF on ERIC

Download full text

Klufa, Jindrich – Journal on Efficiency and Responsibility in Education and Science, 2016

The paper contains an analysis of the differences of number of points in the test in mathematics between test variants, which were used in the entrance examinations at the Faculty of Business Administration at University of Economics in Prague in 2015. The differences may arise due to the varying difficulty of variants for students, but also…

Descriptors: Foreign Countries, College Students, Business Administration Education, College Entrance Examinations

A Simulation Study of the Situations in Which Reporting Subscores Can Add Value to Licensure Examinations

Direct link

Feinberg, Richard A. – ProQuest LLC, 2012

Subscores, also known as domain scores, diagnostic scores, or trait scores, can help determine test-takers' relative strengths and weaknesses and appropriately focus remediation. However, subscores often have poor psychometric properties, particularly reliability and distinctiveness (Folske, Gessaroli, & Swanson, 1999; Monaghan, 2006;…

Descriptors: Simulation, Tests, Testing, Scores

Testing for Differences in Test Score Distributions Using Loglinear Models.

Peer reviewed

Hanson, Bradley A. – Applied Measurement in Education, 1996

Determining whether score distributions differ on two or more test forms administered to samples of examinees from a single population is explored using three statistical tests using loglinear models. Examples are presented of applying tests of distribution differences to decide if equating is needed for alternative forms of a test. (SLD)

Descriptors: Equated Scores, Scoring, Statistical Distributions, Test Format

A Statistical Procedure for Assessing Test Dimensionality. Measurement Series 84-2.

Stout, William – 1984

An important problem in psychological test theory is the development of a sound method for determining whether a test which purports to measure the level of a certain ability is, in reality, significantly contaminated by one or more other abilities displayed by persons taking the test. Because of the large number of private and governmental…

Descriptors: Latent Trait Theory, Statistical Analysis, Statistical Distributions, Test Validity

Group Scores: A Response to Baglin.

Peer reviewed

Burket, George R. – Journal of Educational Measurement, 1987

This response to the Baglin paper (1986) points out the fallacy in inferring that inappropriate scaling procedures cause apparent discrepancies between medians and means and between means calculated using different units. (LMO)

Descriptors: Norm Referenced Tests, Scaling, Scoring, Statistical Distributions

Exceptional Performance.

Peer reviewed

Walberg, Herbert J.; And Others – Review of Educational Research, 1984

This paper demonstrates the variety of positive-skew phenomena and discusses their theoretical, research, and practical implications in education. (PN)

Descriptors: Academic Achievement, Data Analysis, Research Problems, Scores

Limitations of the Score-Difference Method in Detecting Cheating in Recognition Test Situations.

Peer reviewed

Roberts, Dennis M. – Journal of Educational Measurement, 1987

This study examines a score-difference model for the detection of cheating based on the difference between two scores for an examinee: one based on the appropriate scoring key and another based on an alternative, inappropriate key. It argues that the score-difference method could falsely accuse students as cheaters. (Author/JAZ)

Descriptors: Answer Keys, Cheating, Mathematical Models, Multiple Choice Tests

On Statistical Testing.

Peer reviewed

Huberty, Carl J. – Educational Researcher, 1987

Two approaches of statistical testing are critically reviewed. A new approach, which is a hybrid of the two, is proposed. The new approach requires the researcher to think about the two types of potential inferential errors and an explicit alternative hypothesis of interest. (VM)

Descriptors: Educational Assessment, Instruction, Multivariate Analysis, Researchers

Previous Page | Next Page »

Pages: 1 | 2

Sinharay, Sandip	4
Allen, Nancy L.	1
Bezirhan, Ummugul	1
Brown, Dianne C.	1
Burket, George R.	1
Canner, Jane	1
Case, Susan M.	1
Charters, W. W., Jr.	1
Drahozal, Edward C.	1
Duong, Minh Q.	1
Eads, Gerald M., II	1
Feinberg, Richard A.	1
García Rios, Víctor N.	1
Hanson, Bradley A.	1
Huberty, Carl J.	1
Jones, Patricia B.	1
Klufa, Jindrich	1
Licea, Guadalupe Carrasco	1
Miller, Timothy R.	1
Pitner, Nancy J.	1
Reckase, Mark D.	1
Roberts, Dennis M.	1
Sabers, Darrell L.	1
Santmire, Toni E.	1
More ▼