Publication Date
In 2025 | 4 |
Since 2024 | 8 |
Since 2021 (last 5 years) | 19 |
Since 2016 (last 10 years) | 35 |
Since 2006 (last 20 years) | 57 |
Descriptor
Test Validity | 165 |
Test Reliability | 68 |
Test Construction | 52 |
Validity | 52 |
Higher Education | 36 |
Test Items | 35 |
Predictive Validity | 33 |
Scores | 33 |
Item Analysis | 31 |
Test Interpretation | 30 |
Test Bias | 29 |
More ▼ |
Source
Journal of Educational… | 252 |
Author
Publication Type
Education Level
Higher Education | 6 |
Postsecondary Education | 6 |
Secondary Education | 4 |
Middle Schools | 3 |
Elementary Education | 2 |
Elementary Secondary Education | 2 |
Junior High Schools | 2 |
Grade 7 | 1 |
Grade 8 | 1 |
High Schools | 1 |
Audience
Researchers | 7 |
Practitioners | 2 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating

Gross, Alan L.; Shulman, Vivian – Journal of Educational Measurement, 1980
The suitability of the beta binomial test model for criterion referenced testing was investigated, first by considering whether underlying assumptions are realistic, and second, by examining the robustness of the model. Results suggest that the model may have practical value. (Author/RD)
Descriptors: Criterion Referenced Tests, Goodness of Fit, Higher Education, Item Sampling

Sawyer, Richard – Journal of Educational Measurement, 1996
Decision theory is a useful method for assessing the effectiveness of the components of a course placement system. The effectiveness of placement tests or other variables in identifying underprepared students is described by the conditional probability of success in a standard course. Estimating the conditional probability of success is discussed.…
Descriptors: College Students, Estimation (Mathematics), Higher Education, Mathematical Models

Lennon, Roger T. – Journal of Educational Measurement, 1975
Reviews the 1974 Standards, an updating serving as a guide to test making and publishing, and training of persons for these endeavors. (DEP)
Descriptors: Educational Testing, Psychological Testing, Scoring, Standards

Nevo, Baruch – Journal of Educational Measurement, 1985
A literature review and a proposed means of measuring face validity, a test's appearance of being valid, are presented. Empirical evidence from examinees' perceptions of a college entrance examination support the reliability of measuring face validity. (GDC)
Descriptors: College Entrance Examinations, Evaluation Methods, Evaluators, Foreign Countries

Garg, Rashmi; And Others – Journal of Educational Measurement, 1986
For the purpose of obtaining data to use in test development, multiple matrix sampling plans were compared to examinee sampling plans. Data were simulated for examinees, sampled from a population with a normal distribution of ability, responding to items selected from an item universe. (Author/LMO)
Descriptors: Difficulty Level, Monte Carlo Methods, Sampling, Statistical Studies

Stenner, A. Jackson; And Others – Journal of Educational Measurement, 1983
In an attempt to restore the symmetry and balance between the study of person and item variation, this paper presents a novel methodology construct specification equations, which allows one to ascertain from the lawful behavior of items what an instrument is measuring. (Author/PN)
Descriptors: Measurement Objectives, Measurement Techniques, Research Methodology, Test Construction

Humphreys, Lloyd G.; Taber, Thomas – Journal of Educational Measurement, 1973
Preliminary factor analyses of predictor tests in advantaged and disadvantaged groups is recommended as a way of forming a priori expectations concerning validities of the predictors to guide both use and research. (Authors)
Descriptors: Ability Identification, Disadvantaged, Factor Analysis, Individual Differences

Collet, Leverne S. – Journal of Educational Measurement, 1971
The purpose of this paper was to provide an empirical test of the hypothesis that elimination scores are more reliable and valid than classical corrected-for-guessing scores or weighted-choice scores. The evidence presented supports the hypothesized superiority of elimination scoring. (Author)
Descriptors: Evaluation, Guessing (Tests), Multiple Choice Tests, Scoring Formulas

Schmidt, William H. – Journal of Educational Measurement, 1983
A conception of invalidity as bias is related to content validity for standardized achievement tests. A method of estimating content bias for each of three content domains (a priori, curricular, and instructional) based on the specification of a content taxonomy is also proposed. (Author/CM)
Descriptors: Achievement Tests, Content Analysis, Evaluation Methods, Instruction

Quellmalz, Edys S.; And Others – Journal of Educational Measurement, 1982
This study investigates the construct validity of profiles of high school students' (n=92) writing competence obtained from tasks designed to be parallel on all dimensions but discourse mode (expository or narrative) and response mode (direct or indirect). (Author/PN)
Descriptors: Discourse Analysis, Expository Writing, Grade 12, Profiles

Ebel, Robert L. – Journal of Educational Measurement, 1982
Reasonable and practical solutions to two major problems confronting the developer of any test of educational achievement (what to measure and how to measure it) are proposed, defended, and defined. (Author/PN)
Descriptors: Measurement Techniques, Objective Tests, Test Construction, Test Items

Anderson, Ronald E.; And Others – Journal of Educational Measurement, 1982
Findings on alternative procedures for evaluating measures of achievement in individual data packages at the National Assessment of Educational Progress are presented with their methodological implications. The need for secondary analysts to be aware of the organization of the data, and positive and negative features are discussed. (Author/CM)
Descriptors: Achievement, Databases, Educational Assessment, Elementary Secondary Education

Winne, Philip H.; Belfry, M. Joan – Journal of Educational Measurement, 1982
This review of issues about correcting for attenuation concludes that the basic difficulty lies in being able to identify and equate sources of variance in estimates of validity and reliability. Recommendations are proposed for cautious use of correction for attenuation. (Author/CM)
Descriptors: Correlation, Error of Measurement, Research Methodology, Statistical Analysis

Wardrop, James L.; And Others – Journal of Educational Measurement, 1982
A structure for describing different approaches to testing is generated by identifying five dimensions along which tests differ: test uses, item generation, item revision, assessment of precision, and validation. These dimensions are used to profile tests of reading comprehension. Only norm-referenced achievement tests had an inference system…
Descriptors: Achievement Tests, Comparative Analysis, Educational Testing, Models

Ward, William C.; And Others – Journal of Educational Measurement, 1980
Free response and machine-scorable versions of a test called Formulating Hypotheses were compared with respect to construct validity. Results indicate that the different forms involve different cognitive processes and measure different qualities. (Author/JKS)
Descriptors: Cognitive Processes, Cognitive Tests, Higher Education, Personality Traits