ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	2

Source

Applied Measurement in…

Author

De Champlain, Andre	1
Fitzpatrick, Anne R.	1
Gessaroli, Marc E.	1
Hambleton, Ronald K.	1
Kingsbury, G. Gage	1
Linn, Robert L.	1
Qualls, Audrey L.	1
Wise, Steven L.	1
Wollack, James A.	1
Yen, Wendy M.	1
Zara, Anthony R.	1
More ▼

Publication Type

Journal Articles	7
Reports - Evaluative	7
Information Analyses	1
Speeches/Meeting Papers	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Procedures for Selecting Items for Computerized Adaptive Tests.

Peer reviewed

Kingsbury, G. Gage; Zara, Anthony R. – Applied Measurement in Education, 1989

Several classical approaches and alternative approaches to item selection for computerized adaptive testing (CAT) are reviewed and compared. The study also describes procedures for constrained CAT that may be added to classical item selection approaches to allow them to be used for applied testing. (TJH)

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Construction, Test Length

Assessing the Dimensionality of Item Response Matrices with Small Sample Sizes and Short Test Lengths.

Peer reviewed

De Champlain, Andre; Gessaroli, Marc E. – Applied Measurement in Education, 1998

Type I error rates and rejection rates for three-dimensionality assessment procedures were studied with data sets simulated to reflect short tests and small samples. Results show that the G-squared difference test (D. Bock, R. Gibbons, and E. Muraki, 1988) suffered from a severely inflated Type I error rate at all conditions simulated. (SLD)

Descriptors: Item Response Theory, Matrices, Sample Size, Simulation

Simultaneous Use of Multiple Answer Copying Indexes to Improve Detection Rates

Peer reviewed

Direct link

Wollack, James A. – Applied Measurement in Education, 2006

Many of the currently available statistical indexes to detect answer copying lack sufficient power at small [alpha] levels or when the amount of copying is relatively small. Furthermore, there is no one index that is uniformly best. Depending on the type or amount of copying, certain indexes are better than others. The purpose of this article was…

Descriptors: Statistical Analysis, Item Analysis, Test Length, Sample Size

Estimating the Reliability of a Test Containing Multiple Item Formats.

Peer reviewed

Qualls, Audrey L. – Applied Measurement in Education, 1995

Classically parallel, tau-equivalently parallel, and congenerically parallel models representing various degrees of part-test parallelism and their appropriateness for tests composed of multiple item formats are discussed. An appropriate reliability estimate for a test with multiple item formats is presented and illustrated. (SLD)

Descriptors: Achievement Tests, Estimation (Mathematics), Measurement Techniques, Test Format

The Effects of Test Length and Sample Size on the Reliability and Equating of Tests Composed of Constructed-Response Items.

Peer reviewed

Fitzpatrick, Anne R.; Yen, Wendy M. – Applied Measurement in Education, 2001

Examined the effects of test length and sample size on the alternate forms reliability and equating of simulated mathematics tests composed of constructed response items scaled using the two-parameter partial credit model. Results suggest that, to obtain acceptable reliabilities and accurate equated scores, tests should have at least 8 6-point…

Descriptors: Constructed Response, Equated Scores, Mathematics Tests, Reliability

An Investigation of the Differential Effort Received by Items on a Low-Stakes Computer-Based Test

Peer reviewed

Direct link

Wise, Steven L. – Applied Measurement in Education, 2006

In low-stakes testing, the motivation levels of examinees are often a matter of concern to test givers because a lack of examinee effort represents a direct threat to the validity of the test data. This study investigated the use of response time to assess the amount of examinee effort received by individual test items. In 2 studies, it was found…

Descriptors: Computer Assisted Testing, Motivation, Test Validity, Item Response Theory

Customized Tests and Customized Norms.

Peer reviewed

Linn, Robert L.; Hambleton, Ronald K. – Applied Measurement in Education, 1991

Four main approaches to customized testing are described, and their resulting scores' valid uses and interpretations are discussed. Customized testing can yield valid normative and curriculum-specific information, although cautious application is needed to avoid misleading inferences about student achievement. (SLD)

Descriptors: Academic Achievement, Accountability, Criterion Referenced Tests, Curriculum

Test Length	7
Test Construction	4
Sample Size	3
Computer Assisted Testing	2
Equated Scores	2
Item Analysis	2
Item Response Theory	2
Scores	2
Simulation	2
Test Reliability	2
Test Validity	2
Academic Achievement	1
Accountability	1
Achievement Tests	1
Adaptive Testing	1
Comparative Analysis	1
Constructed Response	1
Criterion Referenced Tests	1
Curriculum	1
Error Patterns	1
Estimation (Mathematics)	1
Guessing (Tests)	1
Indexes	1
Inferences	1
Mathematics Tests	1
More ▼