ERIC - Search Results

Descriptor

Comparative Analysis	9
Mastery Tests	9
Test Items	9
Item Analysis	5
Statistical Analysis	4
Test Construction	4
Criterion Referenced Tests	3
Difficulty Level	3
Latent Trait Theory	3
Statistical Studies	3
Cutting Scores	2
Goodness of Fit	2
Knowledge Level	2
Mathematical Models	2
Scores	2
Statistical Distributions	2
Test Length	2
Test Theory	2
Academic Standards	1
Adaptive Testing	1
Basic Skills	1
Certification	1
Communication Skills	1
Computer Assisted Testing	1
Content Validity	1
More ▼

Source

Journal of Educational…

Author

Beard, Jacob G.	1
Frick, Theodore W.	1
Hambleton, Ronald K.	1
Harris, Deborah J.	1
Huynh, Huynh	1
Klein, Thomas W.	1
Melican, Gerald J.	1
Mills, Craig N.	1
Pettie, Allan L.	1
Phillips, Gary W.	1
Sarvela, Paul D.	1
Saunders, Joseph C.	1
Subkoviak, Michael J.	1
More ▼

Publication Type

Reports - Research	8
Speeches/Meeting Papers	6
Journal Articles	1
Reports - Evaluative	1

Education Level

Audience

Researchers

Location

South Carolina

Laws, Policies, & Programs

Assessments and Surveys

Comprehensive Tests of Basic…

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Accuracy of Two Procedures for Estimating Reliability of Mastery Tests.

Peer reviewed

Huynh, Huynh; Saunders, Joseph C. – Journal of Educational Measurement, 1980

Single administration (beta-binomial) estimates for the raw agreement index p and the corrected-for-chance kappa index in mastery testing are compared with those based on two test administrations in terms of estimation bias and sampling variability. Bias is about 2.5 percent for p and 10 percent for kappa. (Author/RL)

Descriptors: Comparative Analysis, Error of Measurement, Mastery Tests, Mathematical Models

A Short-Cut Statistic for Item Analysis of Mastery Tests: A Comparison of Three Procedures.

Download full text

Subkoviak, Michael J.; Harris, Deborah J. – 1984

This study examined three statistical methods for selecting items for mastery tests. One is the pretest-posttest method due to Cox and Vargas (1966); it is computationally simple, but has a number of serious limitations. The second is a latent trait method recommended by van der Linden (1981); it is computationally complex, but has a number of…

Descriptors: Comparative Analysis, Elementary Secondary Education, Item Analysis, Latent Trait Theory

A Comparison of an Expert Systems Approach to Computerized Adaptive Testing and an Item Response Theory Model.

Download full text

Frick, Theodore W. – 1991

Expert systems can be used to aid decisionmaking. A computerized adaptive test is one kind of expert system, although not commonly recognized as such. A new approach, termed EXSPRT, was devised that combines expert systems reasoning and sequential probability ratio test stopping rules. Two versions of EXSPRT were developed, one with random…

Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Expert Systems

A Preliminary Investigation of Three Compromise Methods for Establishing Cut-Off Scores.

Download full text

Mills, Craig N.; Melican, Gerald J. – 1987

The study compares three methods for establishing cut-off scores that effect a compromise between absolute cut-offs based on item difficulty and relative cut-offs based on expected passing rates. Each method coordinates these two types of information differently. The Beuk method obtains judges' estimates of an absolute cut-off and an expected…

Descriptors: Academic Standards, Certification, Comparative Analysis, Cutting Scores

Optimal Item Selection with Credentialing Examinations.

Download full text

Hambleton, Ronald K.; And Others – 1987

The study compared two promising item response theory (IRT) item-selection methods, optimal and content-optimal, with two non-IRT item selection methods, random and classical, for use in fixed-length certification exams. The four methods were used to construct 20-item exams from a pool of approximately 250 items taken from a 1985 certification…

Descriptors: Comparative Analysis, Content Validity, Cutting Scores, Difficulty Level

Characteristics Which Differentiate Criterion-Referenced from Norm-Referenced Tests.

Download full text

Klein, Thomas W. – 1990

Characteristics that distinguish criterion-referenced tests from their norm-referenced counterparts are discussed, including: the purposes that they are designed to serve; the characteristics of the types of items that they contain; and the manner in which they are developed. More specifically, the distinguishing characteristics include: reference…

Descriptors: Comparative Analysis, Criterion Referenced Tests, Differences, Educational Assessment

Latent Trait Approach to Domain Score Estimation.

Phillips, Gary W. – 1982

This paper presents an introduction to the use of latent trait models for the estimation of domain scores. It was shown that these models provided an advantage over classical test theory and binomial error models in that unbiased estimates of true domain scores could be obtained even when items were not randomly selected from a universe of items.…

Descriptors: Comparative Analysis, Criterion Referenced Tests, Estimation (Mathematics), Goodness of Fit

Discrimination Indices Commonly Used in Military Training Environments: Effects of Departures from Normal Distributions.

Download full text

Sarvela, Paul D. – 1986

Four discrimination indices were compared, using score distributions which were normal, bimodal, and negatively skewed. The score distributions were systematically varied to represent the common circumstances of a military training situation using criterion-referenced mastery tests. Three 20-item tests were administered to 110 simulated subjects.…

Descriptors: Comparative Analysis, Criterion Referenced Tests, Item Analysis, Mastery Tests

A Comparison of Linear and Rasch Equating Results for Basic Skills Assessment Tests.

Beard, Jacob G.; Pettie, Allan L. – 1979

Test results from the Florida Educational Assessment of third and fifth grade communications and mathematics skills were used to compare linear and Rasch equating results. The samples consisted of over 5,000 cases for each grade and content area. The tests contained some items common to both the 1976 and 1977 test forms, but no fewer than 20…

Descriptors: Basic Skills, Communication Skills, Comparative Analysis, Difficulty Level