Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 1 |
Descriptor
Models | 8 |
Test Reliability | 8 |
True Scores | 8 |
Correlation | 3 |
Measurement Techniques | 3 |
Test Construction | 3 |
Test Validity | 3 |
Testing | 3 |
Comparative Analysis | 2 |
Criterion Referenced Tests | 2 |
Prediction | 2 |
More ▼ |
Author
Attali, Yigal | 1 |
Bergquist, Constance | 1 |
Chang, Lei | 1 |
Cohen, Stanley H. | 1 |
Frayer, Dorothy A. | 1 |
Graham, Darol L. | 1 |
Hunter, John E. | 1 |
Kristof, Walter | 1 |
Ng, K. T. | 1 |
Roudabush, Glenn E. | 1 |
Publication Type
Reports - Research | 4 |
Journal Articles | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating

Kristof, Walter – Psychometrika, 1974
Descriptors: Models, Statistical Analysis, Test Reliability, Testing

Ng, K. T. – Educational and Psychological Measurement, 1974
This paper is aimed at demonstrating that Charles Spearman postulated neither a platonic true-error distinction nor a requirement for constant true scores under repeated measurement. (Author/RC)
Descriptors: Career Development, Correlation, Models, Test Reliability

Hunter, John E.; Cohen, Stanley H. – Psychometrika, 1974
Descriptors: Attitude Change, Attitudes, Comparative Analysis, Models
Attali, Yigal – ETS Research Report Series, 2007
This study examined the construct validity of the "e-rater"® automated essay scoring engine as an alternative to human scoring in the context of TOEFL® essay writing. Analyses were based on a sample of students who repeated the TOEFL within a short time period. Two "e-rater" scores were investigated in this study, the first…
Descriptors: Construct Validity, Computer Assisted Testing, Scoring, English (Second Language)
Roudabush, Glenn E. – 1974
In this paper, several models for the psychometric nature of criterion-referenced tests are presented and results derived with implications for test construction, reliability and validity measures, and educational decision making. Both dichotomous and continuous underlying abilities to perform are considered. Illustrative data fitting both cases…
Descriptors: Criterion Referenced Tests, Decision Making, Evaluation Methods, Measurement Techniques
Frayer, Dorothy A. – 1971
A Paradigm for testing concept attainment, comprised of twelve tasks, was formulated. These tasks were hypothesized to form a cumulative hierarchy. Tests were constructed in mathematics and social studies using the paradigm. Data for these tests was analyzed by Kaiser's method for fitting a perfect simplex and Schonemann's method for fitting a…
Descriptors: Concept Formation, Correlation, Data Analysis, Error of Measurement
Chang, Lei – 1993
Equivalence in reliability and validity across 4-point and 6-point scales was assessed by fitting different measurement models through confirmatory factor analysis of a multitrait-multimethod covariance matrix. Responses to nine Likert-type items designed to measure perceived quantitative ability, self-perceived usefulness of quantitative…
Descriptors: Ability, Comparative Testing, Education Majors, Graduate Students
Graham, Darol L.; Bergquist, Constance – 1975
Two models were identified for criterion-referenced tests, one based on the assumption of a continuous achievement variable and the other assuming a dichotomous or binary variable. Several test characteristics were examined and contrasted for the two models, including the distribution of scores, establishment of a cutting score, test length, item…
Descriptors: Academic Achievement, Achievement Tests, Criterion Referenced Tests, Cutting Scores