Publication Date
| In 2026 | 0 |
| Since 2025 | 17 |
| Since 2022 (last 5 years) | 74 |
| Since 2017 (last 10 years) | 189 |
| Since 2007 (last 20 years) | 384 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 274 |
| Researchers | 122 |
| Teachers | 102 |
| Administrators | 63 |
| Counselors | 28 |
| Parents | 21 |
| Policymakers | 21 |
| Students | 15 |
| Community | 8 |
Location
| Canada | 45 |
| Australia | 33 |
| California | 33 |
| United Kingdom | 23 |
| United States | 20 |
| Pennsylvania | 18 |
| United Kingdom (England) | 17 |
| New York | 15 |
| Japan | 14 |
| Michigan | 14 |
| New Jersey | 12 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedSchultz, Linda J.; Fortune, Jim C. – Education, 1981
Illustrates how primary sources of test bias (instrument bias, item bias, and interpretation bias) can occur in a testing situation. Discusses how biases can be accounted for in testing decisions. Offers rationale for proper utilization of tests and makes a case for the renewal of test utilization in the classroom. (Author/NEC)
Descriptors: Elementary Secondary Education, Standardized Tests, Test Bias, Test Interpretation
Peer reviewedDonlon, Thomas F. – Journal of Educational Measurement, 1981
Scores within the chance range are differentiated, "uninterpretable" scores being those that demonstrate randomness (broadly defined) by failing to achieve typical levels of correlation with group-determined difficulty. The relevant literature is reviewed. Finally, randomness and uninterpretability are examined in light of the…
Descriptors: Difficulty Level, Guessing (Tests), Multiple Choice Tests, Scores
Peer reviewedAndrich, David – Psychometrika, 1978
A rating response mechanism for ordered categories such as in Likert scaling, which is related to the traditional threshold formulation but distinctively different from it, is formulated. The mechanism is based on the Rasch model. Two parameters in addition to the usual Rasch parameters are identified and discussed. (Author/JKS)
Descriptors: Item Analysis, Mathematical Models, Psychometrics, Rating Scales
Peer reviewedLoucks, Sandra; And Others – Journal of School Psychology, 1980
Mexican-American participants in a summer career-exploration program were given the Kent Emergency Scale and the Otis-Lennon Test of Mental Abilities. Correlation between Otis IQ and Kent raw score was significantly positive but lower than those reported in other settings. Caution is warranted in interpreting Kent results in Mexican-Americans.…
Descriptors: Adolescents, Career Exploration, Intelligence Tests, Mexican Americans
Peer reviewedEbel, Robert L. – NASSP Bulletin, 1979
The author concludes that testing in basic education is indispensable. Several objections to external testing are discussed. (Author/MLF)
Descriptors: Basic Skills, Competency Based Education, Opinions, Secondary Education
Peer reviewedGreen, J. R.; And Others – British Journal of Educational Psychology, 1981
A simple unbalanced block model is proposed for examination marks, as an improvement on the usual implicit model. The new model is applied to some real data and is found, by the usual normal linear theory F test, to give a highly significant improvement. Some alternative models are also considered. (Author)
Descriptors: Achievement Rating, Achievement Tests, Models, Scoring Formulas
Peer reviewedWilcox, Rand R. – Psychometrika, 1979
When comparing examinees to a control group or person, the examiner usually does not know the probability of correct classification based on the number of items used and the number of people tested. Using ranking and selection techniques, a framework is described for deriving a lower bound on this probability. (Author/JKS)
Descriptors: Criterion Referenced Tests, Cutting Scores, Probability, Psychometrics
Peer reviewedBrown, Ric – Journal for Research in Mathematics Education, 1980
The author discusses the importance of statistical significance to researchers and suggests that researchers should consider an additional statistic, the magnitude of effect index. (MK)
Descriptors: Educational Research, Mathematics Education, Research Problems, Researchers
Peer reviewedMyerberg, N. James – Educational and Psychological Measurement, 1979
The effect of stratified sampling of items based on item difficulty and/or interitem correlations on the estimation of test score distribution parameters using multiple matrix sampling was studied. Results indicated that stratification did not consistently improve the stability of parameter estimation. (Author/JKS)
Descriptors: Item Analysis, Item Sampling, Matrices, Technical Reports
Peer reviewedHafner, James L.; And Others – Journal of Clinical Psychology, 1979
A WAIS short form, consisting of the Similarities, Picture Arrangement, and Block Design subtests, was administered to 109 undergraduates. Correlation between these scores and their Full Scale WAIS IQ scores was .90. The subtests underestimated IQ by 9.29 points, suggesting that the constant be adjusted for this population. (SJL)
Descriptors: College Students, Correlation, Intelligence Quotient, Intelligence Tests
Peer reviewedYaney, Joseph P. – Performance Improvement, 1997
Offers suggestions for designing a management questionnaire and interpreting employee responses so that executives may make an informed decision on whether to support an intervention. Highlights include employee perceptions on competing goals; supervisory suggestions and employee reactions; and a case study. (Author/LRW)
Descriptors: Case Studies, Employee Attitudes, Questionnaires, Supervision
Peer reviewedHoladay, Margot – Assessment, 1996
A survey of 26 Rorschach experts and 19 students of Rorschach use was conducted to help students using the Exner Comprehensive System determine whether to code movement for nouns with definitions that include movement. Experts and students did not reach agreement, but a literature review suggests such nouns should often be coded as movement. (SLD)
Descriptors: Coding, Definitions, Motion, Nouns
Peer reviewedGreen, Donald Ross; Trimble, C. Scott; Lewis, Daniel M. – Educational Measurement: Issues and Practice, 2003
Describes the procedures by which Kentucky's state assessment program synthesized results from three standard setting procedures (Contrasting Groups, Bookmark, and Jaeger-Mills) for the 2000 state assessment. Shows the value of using multiple standard-setting approaches to gather information from each. (SLD)
Descriptors: Achievement Tests, Standard Setting, State Programs, Synthesis
Peer reviewedBachman, Lyle F. – Educational Measurement: Issues and Practice, 2002
Describes an approach to addressing issues of validity of inferences and the extrapolation of inferences to target domains beyond the assessment for alternative assessments. Makes the case that in both language testing and educational assessment the roles of language and content knowledge must be considered, and that the design and development of…
Descriptors: Alternative Assessment, Educational Assessment, Inferences, Performance Based Assessment
Peer reviewedMcKee, Lynne M.; Levinson, Edward M. – Career Development Quarterly, 1990
Discusses general issues and concerns relative to the adaptation of paper-pencil assessment instruments to computerized formats. Describes and evaluates Self-Directed Search computerized version (SDS-CV). Presents strengths and weaknesses of the SDS-CV and makes recommendations for its use. (Author/ABL)
Descriptors: Career Counseling, Computer Oriented Programs, Evaluation Methods, Reliability


