Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 2 |
| Since 2017 (last 10 years) | 4 |
| Since 2007 (last 20 years) | 8 |
Descriptor
| Testing Problems | 37 |
| Sampling | 31 |
| Test Reliability | 10 |
| Equated Scores | 9 |
| Statistical Analysis | 9 |
| Test Construction | 9 |
| Test Items | 9 |
| Item Sampling | 8 |
| Achievement Tests | 7 |
| Elementary Secondary Education | 7 |
| Test Validity | 7 |
| More ▼ | |
Source
Author
| Wilcox, Rand R. | 2 |
| Angela Mazzone | 1 |
| Angoff, William H. | 1 |
| Askegaard, Lewis D. | 1 |
| Boser, Judith A. | 1 |
| Boyd, Thomas A. | 1 |
| Cahen, Leonard S. | 1 |
| Casserly, Michael | 1 |
| Chen, Yuguo | 1 |
| Conor Scully | 1 |
| Corcoran, Amanda | 1 |
| More ▼ | |
Publication Type
| Reports - Research | 37 |
| Journal Articles | 10 |
| Speeches/Meeting Papers | 10 |
| Books | 1 |
| Guides - Non-Classroom | 1 |
| Information Analyses | 1 |
| Non-Print Media | 1 |
| Opinion Papers | 1 |
| Tests/Questionnaires | 1 |
Education Level
| Elementary Secondary Education | 2 |
| Secondary Education | 2 |
| High Schools | 1 |
| Higher Education | 1 |
| Postsecondary Education | 1 |
Audience
| Researchers | 2 |
Location
| Ireland (Dublin) | 1 |
| Nigeria | 1 |
| United States | 1 |
| West Germany | 1 |
Laws, Policies, & Programs
| Elementary and Secondary… | 1 |
| Emergency School Aid Act 1972 | 1 |
| Individuals with Disabilities… | 1 |
| No Child Left Behind Act 2001 | 1 |
| Perkins Loan Program | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Kim, Sooyeon; Walker, Michael E. – Educational Measurement: Issues and Practice, 2022
Test equating requires collecting data to link the scores from different forms of a test. Problems arise when equating samples are not equivalent and the test forms to be linked share no common items by which to measure or adjust for the group nonequivalence. Using data from five operational test forms, we created five pairs of research forms for…
Descriptors: Ability, Tests, Equated Scores, Testing Problems
Zita Lysaght; Michael O'Leary; Angela Mazzone; Conor Scully – Sage Research Methods Cases, 2022
Since 2018, colleagues from two research centers at Dublin City University have been collaborating to develop a measurement scale to assess individuals' ability to identify workplace bullying. Having agreed on an operational definition of the construct, an item pool of 26 workplace bullying scenarios, that is, short descriptions of…
Descriptors: Foreign Countries, Test Construction, Test Validity, Test Reliability
Sánchez Sánchez, Ernesto; García Rios, Víctor N.; Silvestre Castro, Eleazar; Licea, Guadalupe Carrasco – North American Chapter of the International Group for the Psychology of Mathematics Education, 2020
In this paper, we address the following questions: What misconceptions do high school students exhibit in their first encounter with significance test problems through a repeated sampling approach? Which theory or framework could explain the presence and features of such patterns? With brief prior instruction on the use of Fathom software to…
Descriptors: High School Students, Misconceptions, Statistical Significance, Testing
Diao, Hongyu; Keller, Lisa – Applied Measurement in Education, 2020
Examinees who attempt the same test multiple times are often referred to as "repeaters." Previous studies suggested that repeaters should be excluded from the total sample before equating because repeater groups are distinguishable from non-repeater groups. In addition, repeaters might memorize anchor items, causing item drift under a…
Descriptors: Licensing Examinations (Professions), College Entrance Examinations, Repetition, Testing Problems
Egbo, Anthonia Chinonyelum – World Journal of Education, 2015
This study investigated the Counselling strategies for curbing "Examination Malpractices" in Secondary Schools in Enugu State Nigeria. The researcher used three research questions. The Design used was a descriptive survey design. Sample consisted of 335 respondents comprising principals (N = 19), PTA secretaries (N = 19), teachers (N =…
Descriptors: Counseling Techniques, Questionnaires, Foreign Countries, Surveys
Phillips, Gary W. – Applied Measurement in Education, 2015
This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…
Descriptors: State Programs, Sampling, Research Design, Error of Measurement
Wang, Shudong; Jiao, Hong; Jin, Ying; Thum, Yeow Meng – Online Submission, 2010
The vertical scales of large-scale achievement tests created by using item response theory (IRT) models are mostly based on cluster (or correlated) educational data in which students usually are clustered in certain groups or settings (classrooms or schools). While such application directly violated assumption of independent sample of person in…
Descriptors: Scaling, Achievement Tests, Data Analysis, Item Response Theory
Hart, Ray; Casserly, Michael; Uzzell, Renata; Palacios, Moses; Corcoran, Amanda; Spurgeon, Liz – Council of the Great City Schools, 2015
There has been little data collected on how much testing actually goes on in America's schools and how the results are used. So in the Spring of 2014, the Council staff developed and launched a survey of assessment practices. This report presents the findings from that survey and subsequent Council analysis and review of the data. It also offers…
Descriptors: Urban Schools, Student Evaluation, Testing Programs, Testing
Shoemaker, David M. – 1972
Investigated empirically through post mortem item-examinee sampling were the relative merits of two alternative procedures for allocating items to subtests in multiple matrix sampling and the feasibility of using the jackknife in approximating standard errors of estimate. The results indicate clearly that a partially balanced incomplete block…
Descriptors: Error of Measurement, Item Sampling, Matrices, Sampling
Chen, Yuguo; Small, Dylan – Psychometrika, 2005
Rasch proposed an exact conditional inference approach to testing his model but never implemented it because it involves the calculation of a complicated probability. This paper furthers Rasch's approach by (1) providing an efficient Monte Carlo methodology for accurately approximating the required probability and (2) illustrating the usefulness…
Descriptors: Testing Problems, Probability, Methods, Testing
National Education Association, Washington, DC. – 1975
The National Education Association's Task Force on Testing has stated its opinion that standardized tests are overused. The task force suggests that the application of sampling techniques and a variety of alternatives to current testing practices would accomplish the same purposes. Representatives of the testing industry have indicated that the…
Descriptors: Accountability, Alternative Assessment, Cost Effectiveness, Educational Testing
Hill, Richard K. – 1975
This study is an a priori demonstration of the applicability of multiple matrix sampling techniques to the practical research problem of parameter estimation. Three tests were administered to two separate but parallel populations, with one receiving item samples and the other receiving full tests. Special efforts were made to minimize the context…
Descriptors: Bias, Item Sampling, Matrices, Standardized Tests
Hoover, Randy L.; Kadunc, Nancy – 1983
The purpose of this paper is to examine the nature of discrepancy score phenomena of the Myers-Briggs Type Indicator (MBTI), as related to internal consistency and construct validity of the instrument. Data were collected from 140 university research managers. The data suggest internal consistency problems: only 37.3 percent of the subjects…
Descriptors: Adults, Personality Measures, Personality Traits, Sampling
Lord, Frederic M. – 1981
Transformations or equating of raw test scores on two or more forms of the same test are made interchangeable by empirical procedures deriving the standard error of an equipercentile equating for four different situations. Some numerical results are checked by Monte Carlo methods. Numerical standard errors are computed for two sets of real data.…
Descriptors: Educational Testing, Equated Scores, Error of Measurement, Mathematical Formulas
PDF pending restorationReckase, Mark D. – 1979
Because latent trait models require that large numbers of items be calibrated or that testing of the same large group be repeated, item parameter estimates are often obtained by administering separate tests to different groups and "linking" the results to construct an adequate item pool. Four issues were studied, based upon the analysis…
Descriptors: Achievement Tests, High Schools, Item Banks, Mathematical Models

Peer reviewed
Direct link
