ERIC - Search Results

Descriptor

Statistical Analysis	7
Test Reliability	7
Higher Education	3
Measurement	2
Rating Scales	2
Test Construction	2
Testing Problems	2
Bayesian Statistics	1
Classification	1
Comparative Analysis	1
Creativity Tests	1
Cutting Scores	1
Decision Making	1
Equated Scores	1
Error of Measurement	1
Generalization	1
Item Sampling	1
Latent Trait Theory	1
Least Squares Statistics	1
Mathematical Formulas	1
Mathematical Models	1
Maximum Likelihood Statistics	1
Meta Analysis	1
Predictive Validity	1
Response Style (Tests)	1
More ▼

Source

Applied Psychological…

Author

Brennan, Robert L.	1
Frederiksen, Norman	1
Kaiser, Henry F.	1
Lockwood, Robert E.	1
Mellenbergh, Gideon J.	1
Millsap, Roger E.	1
Rounds, James B., Jr.	1
Serlin, Ronald C.	1
Ward, William C.	1
Weiss, David J., Ed.	1
van der Linden, Wim J.	1
More ▼

Publication Type

Journal Articles	4
Reports - Research	2
Collected Works - Serials	1
Reports - Evaluative	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	1
Minnesota Importance…	1

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Tolerance Intervals: Alternatives to Credibility Intervals in Validity Generalization Research.

Peer reviewed

Millsap, Roger E. – Applied Psychological Measurement, 1988

Two new methods for constructing a credibility interval (CI)--an interval containing a specified proportion of true validity description--are discussed, from a frequentist perspective. Tolerance intervals, unlike the current method of constructing the CI, have performance characteristics across repeated applications and may be useful in validity…

Descriptors: Bayesian Statistics, Meta Analysis, Statistical Analysis, Test Reliability

A Comparison of the Nedelsky and Angoff Cutting Score Procedures Using Generalizability Theory.

Peer reviewed

Brennan, Robert L.; Lockwood, Robert E. – Applied Psychological Measurement, 1980

Generalizability theory is used to characterize and quantify expected variance in cutting scores and to compare the Nedelsky and Angoff procedures for establishing a cutting score. Results suggest that the restricted nature of the Nedelsky (inferred) probability scale may limit its applicability in certain contexts. (Author/BW)

Descriptors: Cutting Scores, Generalization, Statistical Analysis, Test Reliability

Comparability of Multiple Rank Order and Paired Comparison Methods.

Peer reviewed

Rounds, James B., Jr.; And Others – Applied Psychological Measurement, 1978

Two studies compared multiple rank order and paired comparison methods in terms of psychometric characteristics and user reactions. Individual and group item responses, preference counts, and Thurstone normal transform scale values obtained by the multiple rank order method were found to be similar to those obtained by paired comparisons.…

Descriptors: Higher Education, Measurement, Rating Scales, Response Style (Tests)

Contributions to the Method of Paired Comparisons.

Peer reviewed

Kaiser, Henry F.; Serlin, Ronald C. – Applied Psychological Measurement, 1978

A least-squares solution for the method of paired comparisons is given. The approach provokes a theorem regarding the amount of data necessary and sufficient for a solution to be obtained. A measure of the internal consistency of the least-squares fit is developed. (Author/CTM)

Descriptors: Higher Education, Least Squares Statistics, Mathematical Models, Measurement

Problems, Perspectives, and Practical Issues in Equating.

Peer reviewed

Weiss, David J., Ed. – Applied Psychological Measurement, 1987

Issues concerning equating test scores are discussed in an introduction, four papers, and two commentaries. Equating methods research, sampling errors, linear equating, population differences, sources of equating errors, and a circular equating paradigm are considered. (SLD)

Descriptors: Equated Scores, Latent Trait Theory, Maximum Likelihood Statistics, Statistical Analysis

Measures for the Study of Creativity in Scientific Problem-Solving

Peer reviewed

Frederiksen, Norman; Ward, William C. – Applied Psychological Measurement, 1978

A set of Tests of Scientific Thinking were developed for possible use as criterion measures in research on creativity. Scores on the tests describe both quality and quantity of ideas produced in formulating hypotheses, evaluating proposals, solving methodological problems, and devising methods for measuring constructs. (Author/CTM)

Descriptors: Creativity Tests, Higher Education, Item Sampling, Predictive Validity

The Internal and External Optimality of Decisions Based on Tests.

Peer reviewed

Mellenbergh, Gideon J.; van der Linden, Wim J. – Applied Psychological Measurement, 1979

For six tests, coefficient delta as an index for internal optimality is computed. Internal optimality is defined as the magnitude of risk of the decision procedure with respect to the true score. Results are compared with an alternative index (coefficient kappa) for assessing the consistency of decisions. (Author/JKS)

Descriptors: Classification, Comparative Analysis, Decision Making, Error of Measurement