Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 2 |
| Since 2007 (last 20 years) | 5 |
Descriptor
| Testing Problems | 16 |
| Sampling | 15 |
| Evaluation Methods | 7 |
| Test Reliability | 6 |
| Equated Scores | 4 |
| Test Validity | 4 |
| Mathematical Models | 3 |
| Sample Size | 3 |
| Scoring | 3 |
| Test Bias | 3 |
| Testing Programs | 3 |
| More ▼ | |
Source
Author
| Altschuld, James W. | 1 |
| Askegaard, Lewis D. | 1 |
| Chen, Yuguo | 1 |
| Diao, Hongyu | 1 |
| Egbo, Anthonia Chinonyelum | 1 |
| Eiting, Mindert H. | 1 |
| Foster, Jeff L. | 1 |
| Gohmann, Stephen F. | 1 |
| Jaeger, Richard M. | 1 |
| Keller, Lisa | 1 |
| Kim, Sooyeon | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 16 |
| Reports - Research | 10 |
| Reports - Evaluative | 4 |
| Information Analyses | 2 |
| Opinion Papers | 2 |
| Guides - Non-Classroom | 1 |
| Reports - Descriptive | 1 |
Education Level
| Higher Education | 1 |
| Postsecondary Education | 1 |
| Secondary Education | 1 |
Audience
Location
| Nigeria | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| SAT (College Admission Test) | 2 |
| Stanford Binet Intelligence… | 1 |
What Works Clearinghouse Rating
Kim, Sooyeon; Walker, Michael E. – Educational Measurement: Issues and Practice, 2022
Test equating requires collecting data to link the scores from different forms of a test. Problems arise when equating samples are not equivalent and the test forms to be linked share no common items by which to measure or adjust for the group nonequivalence. Using data from five operational test forms, we created five pairs of research forms for…
Descriptors: Ability, Tests, Equated Scores, Testing Problems
Diao, Hongyu; Keller, Lisa – Applied Measurement in Education, 2020
Examinees who attempt the same test multiple times are often referred to as "repeaters." Previous studies suggested that repeaters should be excluded from the total sample before equating because repeater groups are distinguishable from non-repeater groups. In addition, repeaters might memorize anchor items, causing item drift under a…
Descriptors: Licensing Examinations (Professions), College Entrance Examinations, Repetition, Testing Problems
Egbo, Anthonia Chinonyelum – World Journal of Education, 2015
This study investigated the Counselling strategies for curbing "Examination Malpractices" in Secondary Schools in Enugu State Nigeria. The researcher used three research questions. The Design used was a descriptive survey design. Sample consisted of 335 respondents comprising principals (N = 19), PTA secretaries (N = 19), teachers (N =…
Descriptors: Counseling Techniques, Questionnaires, Foreign Countries, Surveys
Phillips, Gary W. – Applied Measurement in Education, 2015
This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…
Descriptors: State Programs, Sampling, Research Design, Error of Measurement
Chen, Yuguo; Small, Dylan – Psychometrika, 2005
Rasch proposed an exact conditional inference approach to testing his model but never implemented it because it involves the calculation of a complicated probability. This paper furthers Rasch's approach by (1) providing an efficient Monte Carlo methodology for accurately approximating the required probability and (2) illustrating the usefulness…
Descriptors: Testing Problems, Probability, Methods, Testing
Peer reviewedGohmann, Stephen F. – Journal of Educational Measurement, 1988
One method to correct for selection bias in comparing Scholastic Aptitude Test (SAT) scores among states is presented, which is a modification of J. J. Heckman's Selection Bias Correction (1976, 1979). Empirical results suggest that sample selection bias is present in SAT score regressions. (SLD)
Descriptors: Regression (Statistics), Sampling, Scoring, Selection
Peer reviewedAltschuld, James W.; And Others – Evaluation and Program Planning, 1992
An original study with a 96 percent questionnaire return rate and four replications with high return rates were compared in terms of populations sampled, implementation, and results. Future use of this procedure, which includes advance mailing, telephone contact, telephone explanations, and followups, is discussed. (SLD)
Descriptors: Comparative Analysis, Evaluation Methods, Mail Surveys, Questionnaires
Peer reviewedAskegaard, Lewis D.; Umila, Benwardo V. – Journal of Educational Measurement, 1982
Multiple matrix sampling of items and examinees was applied to an 18-item rank order instrument administered to a randomly assigned group and compared to the ordering and ranking of all items by control subjects. High correlations between ranks suggest the methodology may viably reduce respondent effort on long rank ordering tasks. (Author/CM)
Descriptors: Evaluation Methods, Item Sampling, Junior High Schools, Student Reaction
Peer reviewedWaddell, Deborah D. – Journal of School Psychology, 1980
A review of the technical data available on the 1972 norms edition of the Stanford-Binet demonstrates how inadequate these data are. The Stanford-Binet should not continue to be used in important decision making processes unless this weakness is corrected. (Author)
Descriptors: Educational Assessment, Elementary Secondary Education, Intelligence Quotient, Intelligence Tests
Peer reviewedEiting, Mindert H. – Applied Psychological Measurement, 1991
A method is proposed for sequential evaluation of reliability of psychometric instruments. Sample size is unfixed; a test statistic is computed after each person is sampled and a decision is made in each stage of the sampling process. Results from a series of Monte-Carlo experiments establish the method's efficiency. (SLD)
Descriptors: Computer Simulation, Equations (Mathematics), Estimation (Mathematics), Mathematical Models
Peer reviewedWainer, Howard – Journal of Educational Measurement, 1986
Describes recent research attempts to draw inferences about the relative standing of the states on the basis of mean SAT scores. This paper identifies five serious errors that call into question the validity of such inferences. Some plausible ways to avoid the errors are described. (Author/LMO)
Descriptors: College Entrance Examinations, Equated Scores, Mathematical Models, Predictor Variables
Peer reviewedJaeger, Richard M. – Educational Measurement: Issues and Practice, 1991
Issues concerning the selection of judges for standard setting are discussed. Determining the consistency of judges' recommendations, or their congruity with other expert recommendations, would help in selection. Enough judges must be chosen to allow estimation of recommendations by an entire population of judges. (SLD)
Descriptors: Cutting Scores, Evaluation Methods, Evaluators, Examiners
Peer reviewedStockman, Ida J. – Language, Speech, and Hearing Services in Schools, 1996
This article discusses the use of language sample analysis (LSA) as a screening tool for preschool linguistic minority children due to the difficulty of using standardized tests in assessing language delays in speakers of minority dialects and languages. The use of LSA with seven African American preschoolers is examined. (CR)
Descriptors: Black Students, Diagnostic Tests, Evaluation Methods, Language Minorities
Peer reviewedWilcox, Rand R. – Journal of Experimental Education, 1982
A closed sequential procedure for estimating true score is proposed for use with answer-until-correct tests. The accuracy of determining true score is the same as in conventional sequential solutions, but the possibility of using an unnecessarily large number of items is eliminated. (Author/CM)
Descriptors: Answer Sheets, Guessing (Tests), Item Banks, Measurement Techniques
Meyer, Kevin D.; Foster, Jeff L. – International Journal of Testing, 2008
With the increasing globalization of human resources practices, a commensurate increase in demand has occurred for multi-language ("global") personality norms for use in selection and development efforts. The combination of data from multiple translations of a personality assessment into a single norm engenders error from multiple sources. This…
Descriptors: Global Approach, Cultural Differences, Norms, Human Resources
Previous Page | Next Page ยป
Pages: 1 | 2
Direct link
