Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 12 |
Since 2006 (last 20 years) | 30 |
Descriptor
Statistical Analysis | 181 |
Testing Problems | 181 |
Test Reliability | 36 |
Test Validity | 35 |
Test Construction | 31 |
Scores | 28 |
Test Bias | 28 |
Test Interpretation | 27 |
Achievement Tests | 26 |
Foreign Countries | 25 |
Test Items | 25 |
More ▼ |
Source
Author
Sinharay, Sandip | 5 |
Frary, Robert B. | 3 |
Linn, Robert L. | 3 |
Barker, Pierce | 2 |
Bormuth, John R. | 2 |
Choi, Seung W. | 2 |
Echternacht, Gary | 2 |
Hambleton, Ronald K. | 2 |
Hurley, Christine | 2 |
Kelderman, Henk | 2 |
Kim, Dong-In | 2 |
More ▼ |
Publication Type
Education Level
Higher Education | 12 |
Postsecondary Education | 11 |
Secondary Education | 5 |
Adult Education | 2 |
Elementary Secondary Education | 2 |
Elementary Education | 1 |
High Schools | 1 |
Audience
Researchers | 12 |
Practitioners | 4 |
Teachers | 2 |
Location
Netherlands | 5 |
Iran | 4 |
Australia | 2 |
Germany | 2 |
United Kingdom | 2 |
Asia | 1 |
California (Stanford) | 1 |
Canada | 1 |
China | 1 |
Colorado (Denver) | 1 |
Costa Rica | 1 |
More ▼ |
Laws, Policies, & Programs
Elementary and Secondary… | 2 |
Emergency School Aid Act 1972 | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Benton, Tom; Williamson, Joanna – Research Matters, 2022
Equating methods are designed to adjust between alternate versions of assessments targeting the same content at the same level, with the aim that scores from the different versions can be used interchangeably. The statistical processes used in equating have, however, been extended to statistically "link" assessments that differ, such as…
Descriptors: Statistical Analysis, Equated Scores, Definitions, Alternative Assessment
Haberman, Shelby J.; Lee, Yi-Hsuan – ETS Research Report Series, 2017
In investigations of unusual testing behavior, a common question is whether a specific pattern of responses occurs unusually often within a group of examinees. In many current tests, modern communication techniques can permit quite large numbers of examinees to share keys, or common response patterns, to the entire test. To address this issue,…
Descriptors: Student Evaluation, Testing, Item Response Theory, Maximum Likelihood Statistics
Sinharay, Sandip; Wan, Ping; Choi, Seung W.; Kim, Dong-In – Journal of Educational Measurement, 2015
With an increase in the number of online tests, the number of interruptions during testing due to unexpected technical issues seems to be on the rise. For example, interruptions occurred during several recent state tests. When interruptions occur, it is important to determine the extent of their impact on the examinees' scores. Researchers such as…
Descriptors: Computer Assisted Testing, Testing Problems, Scores, Statistical Analysis
Sinharay, Sandip; Duong, Minh Q.; Wood, Scott W. – Journal of Educational Measurement, 2017
As noted by Fremer and Olson, analysis of answer changes is often used to investigate testing irregularities because the analysis is readily performed and has proven its value in practice. Researchers such as Belov, Sinharay and Johnson, van der Linden and Jeon, van der Linden and Lewis, and Wollack, Cohen, and Eckerly have suggested several…
Descriptors: Identification, Statistics, Change, Tests
Harwell, Michael – Journal of Experimental Education, 2019
Measures of socioeconomic status (SES) are widely used in educational research and policy applications in no small part because of a deeply rooted belief of the importance of SES. This paper argues that the usefulness of common SES measures can be undermined by (a) an atheoretical approach to conceptualizing SES and selecting measures, which…
Descriptors: Socioeconomic Status, Measures (Individuals), Testing Problems, Educational Research
Sinharay, Sandip; Johnson, Matthew S. – Educational and Psychological Measurement, 2017
In a pioneering research article, Wollack and colleagues suggested the "erasure detection index" (EDI) to detect test tampering. The EDI can be used with or without a continuity correction and is assumed to follow the standard normal distribution under the null hypothesis of no test tampering. When used without a continuity correction,…
Descriptors: Deception, Identification, Testing Problems, Error of Measurement
Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2017
An increasing concern of producers of educational assessments is fraudulent behavior during the assessment (van der Linden, 2009). Benefiting from item preknowledge (e.g., Eckerly, 2017; McLeod, Lewis, & Thissen, 2003) is one type of fraudulent behavior. This article suggests two new test statistics for detecting individuals who may have…
Descriptors: Test Items, Cheating, Testing Problems, Identification
Davis, Sara D.; Chan, Jason C. K. – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2015
Retrieving studied materials often enhances subsequent learning of new materials (Pastötter & Bäuml, 2014). However, retrieval has also been shown to impair new learning (Finn & Roediger, 2013). In this article, we attempted to determine when retrieval enhances and when it impairs new learning. We argue that testing impairs new learning…
Descriptors: Recall (Psychology), Information Retrieval, Testing, Testing Problems
Kiley, Margaret; Holbrook, Allyson; Lovat, Terence; Fairbairn, Hedy; Starfield, Sue; Paltridge, Brian – Australian Universities' Review, 2018
While there has been considerable research on doctoral examination there is little that examines the various roles of the oral component and what issues one might consider if introducing or revising that aspect of the thesis examination process. This matter is of particular importance in Australia where it is not usual to have an oral component as…
Descriptors: Foreign Countries, Doctoral Dissertations, Evaluation Methods, Verbal Tests
Sinharay, Sandip; Wan, Ping; Whitaker, Mike; Kim, Dong-In; Zhang, Litong; Choi, Seung W. – Journal of Educational Measurement, 2014
With an increase in the number of online tests, interruptions during testing due to unexpected technical issues seem unavoidable. For example, interruptions occurred during several recent state tests. When interruptions occur, it is important to determine the extent of their impact on the examinees' scores. There is a lack of research on this…
Descriptors: Computer Assisted Testing, Testing Problems, Scores, Regression (Statistics)
Yu, Guoxing; He, Lianzhen; Rea-Dickins, Pauline; Kiely, Richard; Lu, Yanbin; Zhang, Jing; Zhang, Yan; Xu, Shasha; Fang, Lin – ETS Research Report Series, 2017
Language test preparation has often been studied within the consequential validity framework in relation to ethics, equity, fairness, and washback of assessment. The use of independent and integrated speaking tasks in the "TOEFL iBT"® test represents a significant development and innovation in assessing speaking ability in academic…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Oral Language
Taskinen, Päivi H.; Steimel, Jochen; Gräfe, Linda; Engell, Sebastian; Frey, Andreas – Peabody Journal of Education, 2015
This study examined students' competencies in engineering education at the university level. First, we developed a competency model in one specific field of engineering: process dynamics and control. Then, the theoretical model was used as a frame to construct test items to measure students' competencies comprehensively. In the empirical…
Descriptors: Models, Engineering Education, Test Items, Outcome Measures
Guarino, Cassandra M.; Reckase, Mark D.; Stacy, Brian W.; Wooldridge, Jeffrey M. – Journal of Research on Educational Effectiveness, 2015
We study the properties of two specification tests that have been applied to a variety of estimators in the context of value-added measures (VAMs) of teacher and school quality: the Hausman test for choosing between student-level random and fixed effects, and a test for feedback (sometimes called a "falsification test"). We discuss…
Descriptors: Teacher Effectiveness, Educational Quality, Evaluation Methods, Tests
Unamma, Anthony Odera – Open Praxis, 2013
This research work was aimed at determining the degree of community members' interference in the conduct of university distance learning examination in South Eastern Nigeria. It was also aimed at finding out the factors responsible for the community members' interference, the ways by which interference is effected, the consequences and the…
Descriptors: Foreign Countries, Distance Education, Community Involvement, Testing Problems
Debeer, Dries; Janssen, Rianne; De Boeck, Paul – Journal of Educational Measurement, 2017
When dealing with missing responses, two types of omissions can be discerned: items can be skipped or not reached by the test taker. When the occurrence of these omissions is related to the proficiency process the missingness is nonignorable. The purpose of this article is to present a tree-based IRT framework for modeling responses and omissions…
Descriptors: Item Response Theory, Test Items, Responses, Testing Problems