Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 1 |
Descriptor
Statistical Analysis | 40 |
Testing Problems | 40 |
Test Validity | 35 |
Test Construction | 13 |
Test Reliability | 13 |
Test Bias | 11 |
Item Analysis | 9 |
Scores | 9 |
Measurement Techniques | 7 |
Testing | 7 |
Achievement Tests | 6 |
More ▼ |
Source
Didakometry | 1 |
ETS Research Report Series | 1 |
Education and Urban Society | 1 |
Educational and Psychological… | 1 |
Journal of Educational… | 1 |
Journal of Educational… | 1 |
NCME Measurement in Education | 1 |
Psychometrika | 1 |
Author
Publication Type
Education Level
Audience
Researchers | 3 |
Laws, Policies, & Programs
Assessments and Surveys
General Aptitude Test Battery | 2 |
Armed Services Vocational… | 1 |
Metropolitan Achievement Tests | 1 |
Metropolitan Readiness Tests | 1 |
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Yu, Guoxing; He, Lianzhen; Rea-Dickins, Pauline; Kiely, Richard; Lu, Yanbin; Zhang, Jing; Zhang, Yan; Xu, Shasha; Fang, Lin – ETS Research Report Series, 2017
Language test preparation has often been studied within the consequential validity framework in relation to ethics, equity, fairness, and washback of assessment. The use of independent and integrated speaking tasks in the "TOEFL iBT"® test represents a significant development and innovation in assessing speaking ability in academic…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Oral Language
Stout, William – 1984
An important problem in psychological test theory is the development of a sound method for determining whether a test which purports to measure the level of a certain ability is, in reality, significantly contaminated by one or more other abilities displayed by persons taking the test. Because of the large number of private and governmental…
Descriptors: Latent Trait Theory, Statistical Analysis, Statistical Distributions, Test Validity
WEITZ, HENRY – 1967
COUNSELORS OFTEN ADMINISTER TESTS OF QUESTIONABLE VALIDITY. IN RELIABILITY STUDIES, EVERY PRECAUTION IS TAKEN TO STABILIZE THE STIMULUS SITUATION. IN ASSESSING VALIDITY, CONCERN CENTERS ON BEHAVIOR UNDER DIFFERENT STIMULUS CONDITIONS. CRONBACH'S THEORETICAL LIMIT FOR A VALIDITY COEFFICIENT OF A TEST IS THE SQUARE ROOT OF THE RELIABILITY…
Descriptors: Aptitude Tests, Career Counseling, Counseling, Counseling Objectives

Echternacht, Gary – Educational and Psychological Measurement, 1974
Descriptors: Evaluation Criteria, Probability, Statistical Analysis, Test Bias
Frary, Robert B. – 1982
Three measures of person-fit (the extent to which an examinee's response pattern on a multiple-choice test is consistent with his ability as estimated by total score) were computed for students taking classroom tests under 12 different instructors at a comprehensive university. Supplementary questions on each test inquired concerning students'…
Descriptors: Higher Education, Multiple Choice Tests, Predictive Validity, Reliability

Linn, Robert L. – Journal of Educational Measurement, 1984
The common approach to studies of predictive bias is analyzed within the context of a conceptual model in which predictors and criterion measures are viewed as fallible indicators of idealized qualifications. (Author/PN)
Descriptors: Certification, Models, Predictive Measurement, Predictive Validity

Akemann, Charles A.; And Others – Journal of Educational Statistics, 1983
Generally, this paper aims to: (1) provide clarification, quantification, and some mathematical analysis to the statistical problem of restricted range in a college admissions situation; and (2) discuss various questions related to the problem of selection strategies. (Author/PN)
Descriptors: Admission Criteria, College Admission, Correlation, Higher Education
Frederiksen, Norman – 1976
A number of different ways of ascertaining whether or not a test measures the same thing in different cultures are examined. Methods range from some that are obvious and simple to those requiring statistical and psychological sophistication. Simpler methods include such things as having candidates "think aloud" and interviewing them about how they…
Descriptors: Analysis of Covariance, Culture Fair Tests, Factor Analysis, Item Analysis
Kapes, Jerome T. – 1975
Two independent studies were conducted to investigate possible differences in General Aptitude Test Battery (GATB) aptitude M resulting from the use of different test equipment (wooden vs. plastic apparatus.) As part of a ten-year longitudinal study of Vocational Development being conducted in the Department of Vocational Education at The…
Descriptors: Aptitude Tests, Comparative Analysis, Elementary Secondary Education, Scores
Larsson, Bernt – Didakometry, 1974
Subjects are asked to answer six questions, partly with a frequency and partly by marking a verbally anchored scale with five categories. Some univariate and multivariate analyses are performed to elucidate the relations between variables with the two different modes of response. Although there are similarities in results for the two types of…
Descriptors: Measurement Techniques, Measures (Individuals), Rating Scales, Responses

Tittle, Carol Kehr – Education and Urban Society, 1975
The purpose of this paper is to describe a set of procedures, that, when carried out, permit the conclusion that a test is a fair measure from the standpoint of specific sub-groups within a test population. A fair test is defined as a test for which a set of data-collection procedures have been carried out and the results reported. (Author/JM)
Descriptors: Academic Achievement, Achievement Tests, Evaluation Criteria, Measurement Techniques
Miller, M. David; Burstein, Leigh – 1981
Two studies are presented in this report. The first is titled "Empirical Studies of Multilevel Approaches to Test Development and Interpretation: Measuring Between-Group Differences in Instruction." Because of a belief that schooling does affect student achievement, researchers have questioned the empirical and measurement techniques…
Descriptors: Error Patterns, Evaluation Methods, Item Analysis, Models
Morse, David T.; Morse, Linda W. – 1976
Performance testing often entails the usage of expensive, time-consuming measures in the quest for determining the level of performance on some desired behavior. It is concluded that a generalizability theory approach to dealing with departures from reality in testing can aid in the establishment of empirically-based choices of measurement…
Descriptors: Cost Effectiveness, Decision Making, Mathematical Models, Measurement Techniques
Green, Donald Ross – 1976
During the past few years the problem of bias in testing has become an increasingly important issue. In most research, bias refers to the fair use of tests and has thus been defined in terms of an outside criterion measure of the performance being predicted by the test. Recently however, there has been growing interest in assessing bias when such…
Descriptors: Achievement Tests, Item Analysis, Mathematical Models, Minority Groups
Kuntz, Patricia – 1982
The quality of mathematics multiple choice items and their susceptibility to test wiseness were examined. Test wiseness was defined as "a subject's capacity to utilize the characteristics and formats of the test and/or test taking situation to receive a high score." The study used results of the Graduate Record Examinations Aptitude Test (GRE) and…
Descriptors: Cues, Item Analysis, Multiple Choice Tests, Psychometrics