ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	8

Source

Applied Psychological…

Publication Type

Journal Articles	48
Reports - Research	23
Reports - Evaluative	16
Reports - Descriptive	4
Opinion Papers	3
Collected Works - Serials	2
Information Analyses	2
Reports - General	2
Collected Works - General	1
Tests/Questionnaires	1

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Location

West Germany	2
Australia	1
Netherlands	1
Sweden	1

Laws, Policies, & Programs

Assessments and Surveys

California Psychological…	2
Graduate Record Examinations	2
Armed Forces Qualification…	1
Armed Services Vocational…	1
Bem Sex Role Inventory	1
Defining Issues Test	1
Edwards Personal Preference…	1
Hidden Figures Test	1
Minnesota Importance…	1
Minnesota Multiphasic…	1
Rod and Frame Test	1
Sixteen Personality Factor…	1
Stanford Binet Intelligence…	1
Strong Campbell Interest…	1
Washington University…	1
Wechsler Intelligence Scale…	1
More ▼

What Works Clearinghouse Rating

Applied Psychological Measurement X

Showing 1 to 15 of 72 results Save | Export

Comparing the Performance of Five Multidimensional CAT Selection Procedures with Different Stopping Rules

Peer reviewed

Direct link

Yao, Lihua – Applied Psychological Measurement, 2013

Through simulated data, five multidimensional computerized adaptive testing (MCAT) selection procedures with varying test lengths are examined and compared using different stopping rules. Fixed item exposure rates are used for all the items, and the Priority Index (PI) method is used for the content constraints. Two stopping rules, standard error…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Selection

Dynamic Problem Solving: A New Assessment Perspective

Peer reviewed

Direct link

Greiff, Samuel; Wustenberg, Sascha; Funke, Joachim – Applied Psychological Measurement, 2012

This article addresses two unsolved measurement issues in dynamic problem solving (DPS) research: (a) unsystematic construction of DPS tests making a comparison of results obtained in different studies difficult and (b) use of time-intensive single tasks leading to severe reliability problems. To solve these issues, the MicroDYN approach is…

Descriptors: Problem Solving, Tests, Measurement, Structural Equation Models

Detecting Halo Effects in Performance-Based Examinations

Peer reviewed

Direct link

Bechger, Timo M.; Maris, Gunter; Hsiao, Ya Ping – Applied Psychological Measurement, 2010

The main purpose of this article is to demonstrate how halo effects may be detected and quantified using two independent ratings of the same person. A practical illustration is given to show how halo effects can be avoided. (Contains 2 tables, 7 figures, and 2 notes.)

Descriptors: Performance Based Assessment, Test Reliability, Test Length, Language Tests

A Clarification of the Effects of Rapid Guessing on Coefficient [Alpha]: A Note on Attali's "Reliability of Speeded Number-Right Multiple-Choice Tests"

Peer reviewed

Direct link

Wise, Steven L.; DeMars, Christine E. – Applied Psychological Measurement, 2009

Attali (2005) recently demonstrated that Cronbach's coefficient [alpha] estimate of reliability for number-right multiple-choice tests will tend to be deflated by speededness, rather than inflated as is commonly believed and taught. Although the methods, findings, and conclusions of Attali (2005) are correct, his article may inadvertently invite a…

Descriptors: Guessing (Tests), Multiple Choice Tests, Test Reliability, Computation

A Critique of Raju and Oshima's Prophecy Formulas for Assessing the Reliability of Item Response Theory-Based Ability Estimates

Peer reviewed

Direct link

Wang, Wen-Chung – Applied Psychological Measurement, 2008

Raju and Oshima (2005) proposed two prophecy formulas based on item response theory in order to predict the reliability of ability estimates for a test after change in its length. The first prophecy formula is equivalent to the classical Spearman-Brown prophecy formula. The second prophecy formula is misleading because of an underlying false…

Descriptors: Test Reliability, Item Response Theory, Computation, Evaluation Methods

Multinomial and Compound Multinomial Error Models for Tests with Complex Item Scoring

Peer reviewed

Direct link

Lee, Won-Chan – Applied Psychological Measurement, 2007

This article introduces a multinomial error model, which models an examinee's test scores obtained over repeated measurements of an assessment that consists of polytomously scored items. A compound multinomial error model is also introduced for situations in which items are stratified according to content categories and/or prespecified numbers of…

Descriptors: Simulation, Error of Measurement, Scoring, Test Items

A Zero-One Programming Approach to Gulliksen's Matched Random Subtests Method.

Peer reviewed

van der Linden, Wim J.; Boekkooi-Timminga, Ellen – Applied Psychological Measurement, 1988

Gulliksen's matched random subtests method is a graphical method to split a test into parallel test halves, allowing maximization of coefficient alpha as a lower bound to the classical test reliability coefficient. This problem is formulated as a zero-one programing problem solvable by algorithms that already exist. (TJH)

Descriptors: Algorithms, Equations (Mathematics), Programing, Test Reliability

Reliability of Total Test Scores When Considered as Ordinal Measurements

Peer reviewed

Direct link

Biswas, Ajoy Kumar – Applied Psychological Measurement, 2006

This article studies the ordinal reliability of (total) test scores. This study is based on a classical-type linear model of observed score (X), true score (T), and random error (E). Based on the idea of Kendall's tau-a coefficient, a measure of ordinal reliability for small-examinee populations is developed. This measure is extended to large…

Descriptors: True Scores, Test Theory, Test Reliability, Scores

Comparison of the Null Distributions of Weighted Kappa and the C Ordinal Statistic

Peer reviewed

Cicchetti, Domenic V.; Fleiss, Joseph L. – Applied Psychological Measurement, 1977

The weighted kappa coefficient is a measure of interrater agreement when the relative seriousness of each possible disagreement can be quantified. This monte carlo study demonstrates the utility of the kappa coefficient for ordinal data. Sample size is also briefly discussed. (Author/JKS)

Descriptors: Mathematical Models, Rating Scales, Reliability, Sampling

Group Dependence of Some Reliability Indices for Mastery Tests.

Peer reviewed

Divgi, D. R. – Applied Psychological Measurement, 1980

The dependence of reliability indices for mastery tests on mean and cutoff scores was examined in the case of three decision-theoretic indices. Dependence of kappa on mean and cutoff scores was opposite to that of the proportion of correct decisions, which was linearly related to average threshold loss. (Author/BW)

Descriptors: Classification, Cutting Scores, Mastery Tests, Test Reliability

Tolerance Intervals: Alternatives to Credibility Intervals in Validity Generalization Research.

Peer reviewed

Millsap, Roger E. – Applied Psychological Measurement, 1988

Two new methods for constructing a credibility interval (CI)--an interval containing a specified proportion of true validity description--are discussed, from a frequentist perspective. Tolerance intervals, unlike the current method of constructing the CI, have performance characteristics across repeated applications and may be useful in validity…

Descriptors: Bayesian Statistics, Meta Analysis, Statistical Analysis, Test Reliability

A Comparison of the Nedelsky and Angoff Cutting Score Procedures Using Generalizability Theory.

Peer reviewed

Brennan, Robert L.; Lockwood, Robert E. – Applied Psychological Measurement, 1980

Generalizability theory is used to characterize and quantify expected variance in cutting scores and to compare the Nedelsky and Angoff procedures for establishing a cutting score. Results suggest that the restricted nature of the Nedelsky (inferred) probability scale may limit its applicability in certain contexts. (Author/BW)

Descriptors: Cutting Scores, Generalization, Statistical Analysis, Test Reliability

Some Comments on the Relation between Reliability and Statistical Power.

Peer reviewed

Humphreys, Lloyd G.; Drasgow, Fritz – Applied Psychological Measurement, 1989

Issues arising from difference scores with zero reliability that nevertheless allow a powerful test of change are discussed. Issues include the appropriateness of underlying statistical models for psychological data and the relationship between difference scores and power. Increases in reliability always increase power for a fixed effect size.…

Descriptors: Goodness of Fit, Mathematical Models, Power (Statistics), Psychometrics

Inter-Inventory Predictability and Content Overlap of the 16PF and the CPI

Peer reviewed

Campbell, John B.; Chun, Ki-Taek – Applied Psychological Measurement, 1977

A multiple regression approach is used to assess the feasibility of reciprocal prediction between the Sixteen Personality Factor Questionnaire scales and the California Psychological Inventory scales (i.e., the prediction of each 16PF scale from the CPI scales and of each CPI scale from the 16PF scales). (RC)

Descriptors: Correlation, Multiple Regression Analysis, Personality Measures, Prediction

Estimating Reliabilities of Computerized Adaptive Tests.

Peer reviewed

Divgi, D. R. – Applied Psychological Measurement, 1989

Two methods for estimating the reliability of a computerized adaptive test (CAT) without using item response theory are presented. The data consist of CAT and paper-and-pencil scores from identical or equivalent samples, and scores for all examinees on one or more covariates, using the Armed Services Vocational Aptitude Battery. (TJH)

Descriptors: Adaptive Testing, Computer Assisted Testing, Estimation (Mathematics), Predictive Validity

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Divgi, D. R.	2
Humphreys, Lloyd G.	2
Wang, Wen-Chung	2
Whitely, Susan E.	2
van der Linden, Wim J.	2
Backteman, G.	1
Baker, A. Harvey	1
Barnes, Janet L.	1
Bechger, Timo M.	1
Bejar, Isaac I.	1
Biswas, Ajoy Kumar	1
Boekkooi-Timminga, Ellen	1
Bray, James H.	1
Brennan, Robert L.	1
Budescu, David V.	1
Burisch, Matthias	1
Campbell, John B.	1
Chun, Ki-Taek	1
Cicchetti, Domenic V.	1
Claudy, John G.	1
Cliff, Norman	1
Cohen, Allan S.	1
Conger, Anthony J.	1
Cudeck, Robert	1
More ▼

Test Reliability	72
Higher Education	22
Test Validity	20
Test Construction	14
Test Items	13
Error of Measurement	10
Mathematical Models	9
Rating Scales	9
Item Analysis	8
Adaptive Testing	7
Computer Assisted Testing	7
Item Response Theory	7
Psychometrics	7
Scores	7
Statistical Analysis	7
Equations (Mathematics)	6
Foreign Countries	6
Personality Measures	6
Response Style (Tests)	6
Scoring	6
Scoring Formulas	6
Test Theory	6
Testing Problems	6
Cognitive Processes	5
Difficulty Level	5
More ▼