ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	4

Descriptor

Test Reliability	41
Scoring	23
Scoring Formulas	18
Test Validity	16
Multiple Choice Tests	11
Guessing (Tests)	9
Weighted Scores	6
Higher Education	5
Item Analysis	5
Test Construction	5
Achievement Tests	4
Correlation	4
Response Style (Tests)	4
Test Items	4
Comparative Analysis	3
Criterion Referenced Tests	3
Cutting Scores	3
Factor Analysis	3
Factor Structure	3
Measurement Techniques	3
Analysis of Variance	2
Attitude Measures	2
Classification	2
Computer Programs	2
Elementary Education	2
More ▼

Source

Educational and Psychological…

Publication Type

Journal Articles	24
Reports - Research	20
Reports - Evaluative	4
Reports - Descriptive	2
Book/Product Reviews	1

Education Level

Audience

Location

Michigan	1
Pennsylvania	1
Switzerland (Geneva)	1

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	1
Medical College Admission Test	1
National Assessment of…	1
Wechsler Intelligence Scale…	1
Woodcock Reading Mastery Test	1

What Works Clearinghouse Rating

Showing 1 to 15 of 41 results Save | Export

Is Effort Moderated Scoring Robust to Multidimensional Rapid Guessing?

Peer reviewed

Direct link

Joseph A. Rios; Jiayi Deng – Educational and Psychological Measurement, 2025

To mitigate the potential damaging consequences of rapid guessing (RG), a form of noneffortful responding, researchers have proposed a number of scoring approaches. The present simulation study examines the robustness of the most popular of these approaches, the unidimensional effort-moderated (EM) scoring procedure, to multidimensional RG (i.e.,…

Descriptors: Scoring, Guessing (Tests), Reaction Time, Item Response Theory

The Importance of Thinking Multivariately When Setting Subscale Cutoff Scores

Peer reviewed

Direct link

Kroc, Edward; Olvera Astivia, Oscar L. – Educational and Psychological Measurement, 2022

Setting cutoff scores is one of the most common practices when using scales to aid in classification purposes. This process is usually done univariately where each optimal cutoff value is decided sequentially, subscale by subscale. While it is widely known that this process necessarily reduces the probability of "passing" such a test,…

Descriptors: Multivariate Analysis, Cutting Scores, Classification, Measurement

Can High-Dimensional Questionnaires Resolve the Ipsativity Issue of Forced-Choice Response Formats?

Peer reviewed

Direct link

Schulte, Niklas; Holling, Heinz; Bürkner, Paul-Christian – Educational and Psychological Measurement, 2021

Forced-choice questionnaires can prevent faking and other response biases typically associated with rating scales. However, the derived trait scores are often unreliable and ipsative, making interindividual comparisons in high-stakes situations impossible. Several studies suggest that these problems vanish if the number of measured traits is high.…

Descriptors: Questionnaires, Measurement Techniques, Test Format, Scoring

Validation of Automated Scoring of Oral Reading

Peer reviewed

Direct link

Balogh, Jennifer; Bernstein, Jared; Cheng, Jian; Van Moere, Alistair; Townshend, Brent; Suzuki, Masanori – Educational and Psychological Measurement, 2012

A two-part experiment is presented that validates a new measurement tool for scoring oral reading ability. Data collected by the U.S. government in a large-scale literacy assessment of adults were analyzed by a system called VersaReader that uses automatic speech recognition and speech processing technologies to score oral reading fluency. In the…

Descriptors: Reading Fluency, Measures (Individuals), Scoring, Reading Ability

The Single Administration Estimate of the Proportion of Agreement of a Proficiency Test Scored with a Latent Structure Model.

Peer reviewed

Wilcox, Rand R. – Educational and Psychological Measurement, 1981

This paper describes and compares procedures for estimating the reliability of proficiency tests that are scored with latent structure models. Results suggest that the predictive estimate is the most accurate of the procedures. (Author/BW)

Descriptors: Criterion Referenced Tests, Scoring, Test Reliability

A Comparison of Interscorer Agreement for the Comprehension, Similarities, and Vocabulary Subtests of the WISC and WISC-R.

Peer reviewed

Cuenot, Randall G.; Darbes, Alex – Educational and Psychological Measurement, 1982

Thirty-one clinical psychologists scored Comprehension, Similarities, and Vocabulary subtest items common to the Wechsler Intelligence Scale for Children (WISC) and the Wechsler Intelligence Scale for Children, Revised (WISC-R). The results on interrater scoring agreement suggest that the scoring of these subtests may be less subjective than…

Descriptors: Clinical Psychology, Intelligence Tests, Psychologists, Scoring

Factor Score Reliabilities and Domain Validities.

Peer reviewed

Gorsuch, Richard L. – Educational and Psychological Measurement, 1980

Kaiser and Michael reported a formula for factor scores giving an internal consistency reliability and its square root, the domain validity. Using this formula is inappropriate if variables are included which have trival weights rather than salient weights for the factor for which the score is being computed. (Author/RL)

Descriptors: Factor Analysis, Factor Structure, Scoring Formulas, Test Reliability

The Comparative Validities of Three Scoring Systems Applied to an Objective Achievement Examination in Chemistry

Peer reviewed

Holmes, Roy A.; And Others – Educational and Psychological Measurement, 1974

Descriptors: Chemistry, Multiple Choice Tests, Scoring Formulas, Test Reliability

The Variances of Empirically Derived Option Scoring Weights

Peer reviewed

Echternacht, Gary – Educational and Psychological Measurement, 1975

Estimates for the variances of empirically determined scoring weights are given. It is also shown that test item writers should write distractors that discriminate on the criterion variable when this type of scoring is used. (Author)

Descriptors: Scoring, Statistical Analysis, Test Construction, Test Reliability

Estimating Scorer Agreement for Nominal Categorization Systems.

Peer reviewed

Burton, Nancy W. – Educational and Psychological Measurement, 1981

This study was concerned with selecting a measure of scorer agreement for use with the National Assessment of Educational Progress. The simple percent of agreement and Cohen's kappa were compared. It was concluded that Cohen's kappa does not add sufficient information to make its calculation worthwhile. (Author/BW)

Descriptors: Educational Assessment, Elementary Secondary Education, Quality Control, Scoring

PEP: A FORTRAN Program for Item Analysis

Peer reviewed

Carroll, C. Dennis – Educational and Psychological Measurement, 1976

A computer program for item evaluation, reliability estimation, and test scoring is described. The program contains a variable format procedure allowing flexible input of responses. Achievement tests and affective scales may be analyzed. (Author)

Descriptors: Achievement Tests, Affective Measures, Computer Programs, Item Analysis

The Reliability of a Promotional Job Knowledge Examination Scored by Number of Items Right and by Four Confidence Weighting Procedures and Its Corresponding Concurrent Validity Estimates Relative to Performance Criterion Ratings.

Peer reviewed

Friedland, David L.; Michael, William B. – Educational and Psychological Measurement, 1987

A sample of 153 male police officers were subjects in a test validation study with two objectives: (1) to compare reliability estimates of a 16-item objective achievement examination scored by the conventional items right formula and by four different procedures; and (2) to obtain comparative concurrent validity coefficients of scores arising from…

Descriptors: Achievement Tests, Concurrent Validity, Correlation, Police

Test Reliability and the Kuder-Richardson Formulas: Derivation from Probability Theory

Peer reviewed

Zimmerman, Donald W. – Educational and Psychological Measurement, 1972

Although a great deal of attention has been devoted over a period of years to the estimation of reliability from item statistics, there are still gaps in the mathematical derivation of the Kuder-Richardson results. The main purpose of this paper is to fill some of these gaps, using language consistent with modern probability theory. (Author)

Descriptors: Mathematical Applications, Probability, Scoring Formulas, Statistical Analysis

A Comparison of Empirical Differential Option Weighting Scoring Procedures as a Function of Inter-Item Correlation

Peer reviewed

Bejar, Issac I.; Weiss, David J. – Educational and Psychological Measurement, 1977

The reliabilities yielded by several differential option weighting scoring procedures were compared among themselves as well as against conventional testing. It was found that increases in reliability due to differential option weighting were a function of inter-item correlations. Suggestions for the implementation of differential option weighting…

Descriptors: Correlation, Forced Choice Technique, Item Analysis, Scoring Formulas

Reliability and Concurrent Validity of the Recreation Experience Preference Scales.

Peer reviewed

Tinsley, Howard E. A.; And Others – Educational and Psychological Measurement, 1981

Two procedures for scoring the Recreation Experience Preference scales were investigated using data obtained from respondents engaged in outdoor recreational activities. Both procedures yielded acceptable levels of reliability and concurrent validity. When time is unimportant, the scale score strategy is preferred over the domain score strategy.…

Descriptors: Methods, Outdoor Activities, Participant Satisfaction, Recreational Activities

Previous Page | Next Page »

Pages: 1 | 2 | 3

Michael, William B.	3
Echternacht, Gary	2
Hambleton, Ronald K.	2
Traub, Ross E.	2
Abu-Sayf, F. K.	1
Aiken, Lewis R.	1
Anderson, Judy	1
Andrulis, Richard S.	1
Balogh, Jennifer	1
Bejar, Issac I.	1
Bernstein, Jared	1
Brunza, J. Jay	1
Burton, Nancy W.	1
Bürkner, Paul-Christian	1
Carroll, C. Dennis	1
Cheng, Jian	1
Cross, Lawrence H.	1
Cuenot, Randall G.	1
Cureton, Edward E.	1
Darbes, Alex	1
DeShields, Shirley M.	1
Eakin, Richard R.	1
Frary, Robert B.	1
Friedland, David L.	1
More ▼