Publication Date
| In 2026 | 0 |
| Since 2025 | 27 |
| Since 2022 (last 5 years) | 113 |
| Since 2017 (last 10 years) | 280 |
| Since 2007 (last 20 years) | 517 |
Descriptor
| Testing Problems | 4850 |
| Elementary Secondary Education | 1262 |
| Test Validity | 1008 |
| Test Construction | 801 |
| Standardized Tests | 790 |
| Higher Education | 658 |
| Test Reliability | 607 |
| Student Evaluation | 583 |
| Testing | 564 |
| Test Bias | 562 |
| Achievement Tests | 555 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 248 |
| Researchers | 220 |
| Teachers | 81 |
| Administrators | 35 |
| Policymakers | 34 |
| Parents | 15 |
| Counselors | 13 |
| Students | 5 |
| Community | 3 |
| Support Staff | 2 |
Location
| Canada | 52 |
| Australia | 45 |
| California | 44 |
| United Kingdom | 37 |
| United States | 36 |
| United Kingdom (England) | 31 |
| China | 29 |
| Netherlands | 26 |
| Florida | 25 |
| New York | 25 |
| United Kingdom (Great Britain) | 24 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards with or without Reservations | 1 |
Peer reviewedvan den Wollenberg, Arnold L. – Psychometrika, 1982
Presently available test statistics for the Rasch model are shown to be insensitive to violations of the assumption of test unidimensionality. Two new statistics are presented. One is similar to available statistics, but with some improvements; the other addresses the problem of insensitivity to unidimensionality. (Author/JKS)
Descriptors: Item Analysis, Latent Trait Theory, Statistics, Test Reliability
Peer reviewedStreiner, David L.; Miller, Harold R. – Journal of Consulting and Clinical Psychology, 1979
A table is provided and described for prorating Minnesota Multiphasic Personality Inventory scales when the entire Form R has not been completed. Good concordance of profile types was found for 300 and 350 completed questions. Interpretations based on 200 items may be suspect. (Author)
Descriptors: Item Analysis, Patients, Personality Assessment, Personality Measures
Simeonsson, Rune J.; And Others – Journal of the Association for the Severely Handicapped (JASH), 1980
The article reviews a number of problems in the assessment of young children with severe handicaps and suggests a number of steps to improve the assessment process. (Author/DLS)
Descriptors: Clinical Diagnosis, Evaluation Methods, Severe Disabilities, Testing Problems
Peer reviewedBrennan, Robert L.; Lockwood, Robert E. – Applied Psychological Measurement, 1980
Generalizability theory is used to characterize and quantify expected variance in cutting scores and to compare the Nedelsky and Angoff procedures for establishing a cutting score. Results suggest that the restricted nature of the Nedelsky (inferred) probability scale may limit its applicability in certain contexts. (Author/BW)
Descriptors: Cutting Scores, Generalization, Statistical Analysis, Test Reliability
Aiken, Lewis R. – New Directions for Testing and Measurement, 1980
A comprehensive overview of the beginnings of attitude measurement is merged with a discussion of recent developments. A synthesis of new research on technical issues related to the reliability and validity of attitude measures and contemporary views on attitude formation and change are presented. (Author)
Descriptors: Attitude Change, Attitude Measures, Test Reliability, Test Validity
Peer reviewedBachor, Dan G. – Journal of Special Education, 1979
Tests used to estimate mental abilities (measures of intelligence, perceptual motor ability, and early identification of learning disabilities) are critically examined. (Author)
Descriptors: Adolescents, Disabilities, Intelligence Tests, Test Reviews
Peer reviewedCalkins, Lucy; Montgomery, Kate; Santman, Donna – Practical Assessment, Research & Evaluation, 1999
Describes common mistakes made by young children taking standardized tests and suggests several teaching strategies that may be useful to teachers who are preparing a class to take standardized tests. Teachers need to be sure they don't add to the pressure of standardized testing by overreacting to small deeds of misbehavior or emphasizing the…
Descriptors: Children, Elementary Secondary Education, Standardized Tests, Test Use
Peer reviewedFoxcroft, Cheryl D. – International Journal of Testing, 2001
Considers ways to implement the International Guidelines for Test Use (International Test Commission, 2001) to maximize their intended impact . The process calls for customizing the guidelines for specific assessment contexts and needs and using the guidelines to generate competency standards. (SLD)
Descriptors: Competence, Educational Assessment, International Education, Standards
Peer reviewedRoth, Jay – Journal of Reading, Writing, and Learning Disabilities International, 1989
The article considers issues of objectivity in regard to evaluating a child's performance on a test. Quantum theory is invoked to illustrate the close relationship of the child's "score" to the specific conditions of the evaluation experience. Educators are encouraged to remember that the score cannot be isolated from the testing experience. (DB)
Descriptors: Educational Philosophy, Elementary Secondary Education, Scores, Test Interpretation
Peer reviewedCarwile, Nancy R. – Educational Leadership, 1990
Presents a facetious, ingenious resolution to the percentile dilemma concerning above- and below-average test scores. If schools enrolled the same number of pigs as students and tested both groups, the pigs would fill up the bottom half and all children would rank in the top 50 percent. However, some wrinkles need to be ironed out! (MLH)
Descriptors: Elementary Secondary Education, Humor, Percentage, Scores
Taylor, Ronald L. – Diagnostique, 1988
Though curriculum-based assessment (CBA) is considered a valuable evaluation procedure, areas of concern exist, including: assumptions regarding content of the CBA instrument, appropriateness of CBA as the only information source for individualized education programs, and recognition of CBA's limitations as a procedure/model under the current…
Descriptors: Disabilities, Elementary Secondary Education, Evaluation Methods, Student Evaluation
Peer reviewedWalsh, Patricia C.; And Others – Psychology in the Schools, 1989
Examined ability of Woodcock-Johnson Psycho-Educational Battery (WJPEB) to identify learning-disabled (LD) students. Administered WJPEB to 71 previously identified LD students and evaluated cluster score performance. Used three methods of obtaining discrepancies; slightly more than one-half of LD students were identified. Memory deficits were…
Descriptors: Educational Diagnosis, Elementary Education, Learning Disabilities, Psychoeducational Methods
Peer reviewedOshima, T. C. – Journal of Educational Measurement, 1994
The effect of violating the assumption of nonspeededness on ability and item parameter estimates in item response theory was studied through simulation under three speededness conditions. Results indicate that ability estimation was least affected by speededness but that substantial effects on item parameter estimates were found. (SLD)
Descriptors: Ability, Computer Simulation, Estimation (Mathematics), Item Response Theory
Peer reviewedMann, Jim; And Others – Journal of Offender Rehabilitation, 1992
Examined Minnesota Multiphasic Personality Inventory-2 (MMPI-2) profiles of incarcerated pedophiles in state prisons (n=60), federal prisons (n=24), and military confinement facilities (n=25), each offering different educational and social composition. Multivariate statistics revealed that three groups' profiles were significantly different,…
Descriptors: Child Abuse, Criminals, Personality Assessment, Prisoners
Peer reviewedAlderson, J. Charles; Wall, Dianne – Applied Linguistics, 1993
The notion of washback, that testing influences teaching, is explored and a series of possible hypotheses are advanced. The empirical research in general education and in language education is reviewed to determine whether washback actually exists, how it can be measured, and what accounts for its form. Proposals for future research are suggested.…
Descriptors: Foreign Countries, Language Tests, Teaching Methods, Test Coaching


