Publication Date
| In 2026 | 0 |
| Since 2025 | 8 |
| Since 2022 (last 5 years) | 36 |
| Since 2017 (last 10 years) | 115 |
| Since 2007 (last 20 years) | 378 |
Descriptor
| Test Theory | 1166 |
| Test Items | 262 |
| Test Reliability | 252 |
| Test Construction | 246 |
| Test Validity | 245 |
| Psychometrics | 183 |
| Scores | 176 |
| Item Response Theory | 168 |
| Foreign Countries | 160 |
| Item Analysis | 141 |
| Statistical Analysis | 134 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Location
| United States | 17 |
| United Kingdom (England) | 15 |
| Canada | 14 |
| Australia | 13 |
| Turkey | 12 |
| Sweden | 8 |
| United Kingdom | 8 |
| Netherlands | 7 |
| Texas | 7 |
| New York | 6 |
| Taiwan | 6 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 4 |
| Elementary and Secondary… | 3 |
| Individuals with Disabilities… | 3 |
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedMorgan, Anne; Wainer, Howard – Journal of Educational Statistics, 1980
Two estimation procedures for the Rasch Model of test analysis are reviewed in detail, particularly with respect to new developments that make the more statistically rigorous conditional maximum likelihood estimation practical for use with longish tests. (Author/JKS)
Descriptors: Error of Measurement, Latent Trait Theory, Maximum Likelihood Statistics, Psychometrics
Stamm, Carol Lee; Moore, Joyce E. – Research Quarterly, 1980
Generalizability theory provides the teacher and the researcher with a flexible method for establishing reliability coefficients in tests. This theory is effective in estimating reliability for a set of motor performance test scores. (CJ)
Descriptors: Educational Research, Evaluation Methods, Motor Development, Performance Tests
Peer reviewedGood, Frances – Educational Studies, 1989
Considers issues surrounding the use of differentiated examinations. Discusses how differentiation may be provided, the wording of questions, and how marks should be given. Highlights some pitfalls of using this approach. Concludes that, although differentiated examinations are possible, they will not always meet the needs of the end range of test…
Descriptors: Educational Research, Elementary Secondary Education, Evaluation, Foreign Countries
Peer reviewedMitchell, James V., Jr. – Applied Measurement in Education, 1988
Applications of Oscar K. Buros' values and convictions to current developments in measurement are considered. Biographical information and Buros' personal philosophy on applied measurement are discussed. The Buros tradition refocuses evaluators' attention on the implications of their work for the end users of measurement results--test users and…
Descriptors: Computer Assisted Testing, Educational Assessment, Educational Philosophy, Educational Researchers
Peer reviewedBalch, William R. – Teaching of Psychology, 1989
Studies the effect of item order on test scores and completion time. Students scored slightly higher when test items were grouped sequentially (relating to text and lectures) than on tests when test items were grouped by text chapter but ordered randomly, or when test items were ordered randomly. Found no differences in completion time. (Author/LS)
Descriptors: Educational Research, Higher Education, Performance, Psychology
Peer reviewedHui, C. Harry; Triandis, Harry C. – Journal of Cross-Cultural Psychology, 1989
Examines the question of whether cultural and ethnic groups differ in their extreme response style. Studies questionnaire responses of Hispanic and non-Hispanic male Navy recruits and suggests that differences in extreme response style may be attributable to differences in judgment style across the two cultural groups. (MW)
Descriptors: Cross Cultural Studies, Cultural Differences, Hispanic Americans, Males
Peer reviewedHenning, Grant – Language Testing, 1988
Violations of item unidimensionality on language tests produced distorted estimates of person ability, and violations of person unidimensionality produced distorted estimates of item difficulty. The Bejar Method was sensitive to such distortions. (Author)
Descriptors: Construct Validity, Content Validity, Difficulty Level, Item Analysis
Peer reviewedBruno, James E.; Dirkzwager, A. – Educational and Psychological Measurement, 1995
Determining the optimal number of choices on a multiple-choice test is explored analytically from an information theory perspective. The analysis revealed that, in general, three choices seem optimal. This finding is in agreement with previous statistical and psychometric research. (SLD)
Descriptors: Distractors (Tests), Information Theory, Multiple Choice Tests, Psychometrics
Peer reviewedSpeer, David C.; Greenbaum, Paul E. – Journal of Consulting and Clinical Psychology, 1995
Currently there are at least four pretreatment-posttreatment (pre-post) difference score methods for determining client change. A fifth model, based on a random effects model and multiwave data, represents a growth curve approach and was hypothesized to be more sensitive to detecting significant (p<.05) change than pre-post models. Compares…
Descriptors: Behavior Change, Change, Counseling, Evaluation
Peer reviewedMcGrew, Kevin; Murphy, Suzanne – Journal of School Psychology, 1995
Investigates the general factor and uniqueness characteristics of the individual tests of the Woodcock-Johnson Test of Cognitive Ability-Revised (WJTCA-R). Only 2 of the 19 WJTCA-R tests examined had low general factor loadings, while 2 had low uniqueness. All other tests had medium or high uniqueness. Discusses implications for clinical…
Descriptors: Academic Ability, Cognitive Ability, Intelligence, Intelligence Tests
Peer reviewedShapiro, Steven K.; And Others – Journal of School Psychology, 1995
Examines the performance characteristics of 83 school-identified learning-disabled children on the Differential Ability Scales. Sixty percent showed a significant standard score discrepancy between the General Conceptual Ability and at least one achievement test. Implications regarding the educational diagnostic and intervention processes…
Descriptors: Academic Ability, Achievement Tests, Cognitive Ability, Intelligence
Peer reviewedGoldstein, Harvey – Educational Measurement: Issues and Practice, 1994
This article examines how psychometric models based on certain assumptions have come to be used counterproductively by many practitioners in ways that limit the kinds of conclusions that can be made. The general problem of the context's influence on performance is discussed, and some implications are drawn. (SLD)
Descriptors: Context Effect, Educational Research, Evaluation Methods, Measurement Techniques
Roos, Bertil; Hamilton, David – Assessment in Education Principles Policy and Practice, 2005
This paper considers alternative assessment, feedback and cybernetics. For more than 30 years, debates about the bi-polarity of formative and summative assessment have served as surrogates for discussions about the workings of the mind, the social implications of assessment and, as important, the role of instruction in the advancement of learning.…
Descriptors: Feedback, Formative Evaluation, Cybernetics, Constructivism (Learning)
Trafimow, David; Rice, Stephen – Psychological Review, 2008
People can use a variety of different strategies to perform tasks and these strategies all have two characteristics in common. First, they can be evaluated in comparison with either an absolute or a relative standard. Second, they can be used at varying levels of consistency. In the present article, the authors develop a general theory of task…
Descriptors: Behavior Theories, Performance, Scores, Performance Factors
van der Linden, Wim J. – Applied Psychological Measurement, 2006
Traditionally, error in equating observed scores on two versions of a test is defined as the difference between the transformations that equate the quantiles of their distributions in the sample and population of test takers. But it is argued that if the goal of equating is to adjust the scores of test takers on one version of the test to make…
Descriptors: Equated Scores, Evaluation Criteria, Models, Error of Measurement

Direct link
