ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	7

Source

Applied Psychological…

Author

Reckase, Mark D.	2
Wyse, Adam E.	2
Brennan, Robert L.	1
Chan, Tsze	1
Cohen, Jon	1
Gao, Rui	1
Guo, Hongwen	1
Harris, Deborah J.	1
Jiang, Tao	1
Paek, Insu	1
Petersen, Nancy S.	1
Reese, Lynda M.	1
Seburn, Mary	1
Veldkamp, Bernard P.	1
Yang, Wen-Ling	1
van der Linden, Wim J.	1
More ▼

Publication Type

Journal Articles	10
Reports - Research	4
Reports - Evaluative	3
Information Analyses	2
Book/Product Reviews	1

Education Level

Higher Education

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
General Aptitude Test Battery	1
Law School Admission Test	1

What Works Clearinghouse Rating

Applied Psychological Measurement X

Showing all 10 results Save | Export

A Graphical Approach to Evaluating Equating Using Test Characteristic Curves

Peer reviewed

Direct link

Wyse, Adam E.; Reckase, Mark D. – Applied Psychological Measurement, 2011

An essential concern in the application of any equating procedure is determining whether tests can be considered equated after the tests have been placed onto a common scale. This article clarifies one equating criterion, the first-order equity property of equating, and develops a new method for evaluating equating that is linked to this…

Descriptors: Lawyers, Licensing Examinations (Professions), Testing Programs, Graphs

Accuracy of DIF Estimates and Power in Unbalanced Designs Using the Mantel-Haenszel DIF Detection Procedure

Peer reviewed

Direct link

Paek, Insu; Guo, Hongwen – Applied Psychological Measurement, 2011

This study examined how much improvement was attainable with respect to accuracy of differential item functioning (DIF) measures and DIF detection rates in the Mantel-Haenszel procedure when employing focal and reference groups with notably unbalanced sample sizes where the focal group has a fixed small sample which does not satisfy the minimum…

Descriptors: Test Bias, Accuracy, Reference Groups, Investigations

The Potential Impact of Not Being Able to Create Parallel Tests on Expected Classification Accuracy

Peer reviewed

Direct link

Wyse, Adam E. – Applied Psychological Measurement, 2011

In many practical testing situations, alternate test forms from the same testing program are not strictly parallel to each other and instead the test forms exhibit small psychometric differences. This article investigates the potential practical impact that these small psychometric differences can have on expected classification accuracy. Ten…

Descriptors: Test Format, Test Construction, Testing Programs, Psychometrics

Consistent Estimation of Rasch Item Parameters and Their Standard Errors under Complex Sample Designs

Peer reviewed

Direct link

Cohen, Jon; Chan, Tsze; Jiang, Tao; Seburn, Mary – Applied Psychological Measurement, 2008

U.S. state educational testing programs administer tests to track student progress and hold schools accountable for educational outcomes. Methods from item response theory, especially Rasch models, are usually used to equate different forms of a test. The most popular method for estimating Rasch models yields inconsistent estimates and relies on…

Descriptors: Testing Programs, Educational Testing, Item Response Theory, Computation

Invariance of Score Linkings across Gender Groups for Forms of a Testlet-Based College-Level Examination Program Examination

Peer reviewed

Direct link

Yang, Wen-Ling; Gao, Rui – Applied Psychological Measurement, 2008

This study investigates whether the functions linking number-correct scores to the College-Level Examination Program (CLEP) scaled scores remain invariant over gender groups, using test data on the 16 testlet-based forms of the CLEP College Algebra exam. To be consistent with the operational practice, linking of various test forms to a common…

Descriptors: Mathematics Tests, Algebra, Item Response Theory, Testing Programs

A Discussion of Population Invariance

Peer reviewed

Direct link

Brennan, Robert L. – Applied Psychological Measurement, 2008

The discussion here covers five articles that are linked in the sense that they all treat population invariance. This discussion of population invariance is a somewhat broader treatment of the subject than simply a discussion of these five articles. In particular, occasional reference is made to publications other than those in this issue. The…

Descriptors: Advanced Placement, Law Schools, Science Achievement, Achievement Tests

A Discussion of Population Invariance of Equating

Peer reviewed

Direct link

Petersen, Nancy S. – Applied Psychological Measurement, 2008

This article discusses the five studies included in this issue. Each article addressed the same topic, population invariance of equating. They all used data from major standardized testing programs, and they all used essentially the same statistics to evaluate their results, namely, the root mean square difference and root expected mean square…

Descriptors: Testing Programs, Standardized Tests, Equated Scores, Evaluation Methods

An Integer Programming Approach to Item Bank Design.

Peer reviewed

van der Linden, Wim J.; Veldkamp, Bernard P.; Reese, Lynda M. – Applied Psychological Measurement, 2000

Presents an integer programming approach to item bank design that can be used to calculate an optimal blueprint for an item bank in order to support an existing testing program. Demonstrates the approach empirically using an item bank designed for the Law School Admission Test. (SLD)

Descriptors: Item Banks, Item Response Theory, Test Construction, Testing Programs

Book Review: "Fairness in Employment Testing: Validity Generalization, Minority Issues, and the General Aptitude Test Battery".

Peer reviewed

Reckase, Mark D. – Applied Psychological Measurement, 1990

This book summarizes the evaluation by the Committee on the General Aptitude Test Battery (GATB) of the National Research Council of the U.S. Employment Service's use of the GATB. The book serves professional users, policymakers, and students of psychometrics because of its clear and concise review of important issues. (SLD)

Descriptors: Adults, Book Reviews, Culture Fair Tests, Minority Groups

Effects of Passage and Item Scrambling on Equating Relationships.

Peer reviewed

Harris, Deborah J. – Applied Psychological Measurement, 1991

Effects of passage and item-scrambling on equipercentile and item-response theory equating were investigated using 2 scrambled versions of the American College Testing Program Assessment for approximately 25,000 examinees. Results indicate that using a base-form conversion table with a scrambled form affects the individual examinee level. (SLD)

Descriptors: College Entrance Examinations, Comparative Testing, Context Effect, Equated Scores

Testing Programs	10
Item Response Theory	8
Equated Scores	5
Standardized Tests	4
College Entrance Examinations	3
Evaluation Methods	3
Psychometrics	3
Achievement Tests	2
Computation	2
Cutting Scores	2
Gender Differences	2
Investigations	2
Law Schools	2
National Programs	2
Racial Differences	2
Sampling	2
Science Achievement	2
Test Bias	2
Test Construction	2
Test Format	2
Accuracy	1
Adults	1
Advanced Placement	1
Algebra	1
Book Reviews	1
More ▼