NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 5,686 to 5,700 of 9,547 results Save | Export
Stansfield, Charles W.; And Others – 1990
The development and validation of the Spanish-English Verbatim Translation Exam (SEVTE) is described. The test is for use by the Federal Bureau of Investigation (FBI) in the selection of applicants for the positions of Language Specialist or Contract Linguist. The report is divided into eight sections. Section 1 describes the need for the test,…
Descriptors: Content Validity, English, Language Proficiency, Language Tests
Wolfram, Walt – 1990
Questions are addressed that focus on why lower class and minority group test takers score lower on standardized tests than their middle class Anglo counterparts. The questions include the following: (1) In what ways can dialect differences affect testing? (2) How can dialect differences directly affect a test of language? (3) Shouldn't standard…
Descriptors: Dialects, English, Language Tests, Lower Class
Holland, Paul W.; Thayer, Dorothy T. – 1985
An alternative definition has been developed of the delta scale of item difficulty used at Educational Testing Service. The traditional delta scale uses an inverse normal transformation based on normal ogive models developed years ago. However, no use is made of this fact in typical uses of item deltas. It is simply one way to make the probability…
Descriptors: Difficulty Level, Error Patterns, Estimation (Mathematics), Item Analysis
Sommer, Thomas W. – 1986
A model was developed to provide a uniform method for vocational and technical education content experts to develop test items (written questions and performance measures) that are congruous with course-level exit competencies. The model is essentially a closed-system approach in that specific action verbs must be identified and operationally…
Descriptors: Course Objectives, Models, Objective Tests, Performance Tests
Melican, Gerald; Thomas, Nancy – 1984
Setting standards for the purpose of certification is frequently performed using judgmental techniques such as the Angoff method. This study was performed to identify types of items that judges find hard to rate accurately, that is, types of items on which examinees perform differently than predicted by the judges. Once identified these item types…
Descriptors: Certification, Cutting Scores, Difficulty Level, Minimum Competency Testing
Doolittle, Allen E. – 1983
The stability of selected indices for detecting differential item performance (item bias), from one randomly equivalent sample to another, is addressed. Some recent research has criticized these indices as too unreliable for utility in measuring bias in achievement test items. Using data from a national testing of the ACT Assessment, however, this…
Descriptors: Black Students, Item Analysis, Racial Factors, Reliability
Shannon, Gregory A. – 1983
Rescoring of Center for Occupational and Professional Assessment objective-referenced tests is decided largely by content experts selected by client organizations. A few of the test items, statistically flagged for review, are not rescored. Some of this incongruence could be due to the use of the biserial correlation (r-biserial) as an…
Descriptors: Adults, Criterion Referenced Tests, Item Analysis, Occupational Tests
Boekkooi-Timminga, Ellen – 1989
The construction of parallel tests from item response theory (IRT) based item banks is discussed. Tests are considered parallel whenever their information functions are identical. After the methods for constructing parallel tests are considered, the computational complexity of 0-1 linear programming and the heuristic procedure applied are…
Descriptors: Heuristics, Item Banks, Latent Trait Theory, Mathematical Models
Engelen, Ron J. H.; And Others – 1988
Fisher's information measure for the item difficulty parameter in the Rasch model and its marginal and conditional formulations are investigated. It is shown that expected item information in the unconditional model equals information in the marginal model, provided the assumption of sampling examinees from an ability distribution is made. For the…
Descriptors: Ability, Difficulty Level, Foreign Countries, Latent Trait Theory
PDF pending restoration PDF pending restoration
Boekkooi-Timminga, Ellen – 1986
Nine methods for automated test construction are described. All are based on the concepts of information from item response theory. Two general kinds of methods for the construction of parallel tests are presented: (1) sequential test design; and (2) simultaneous test design. Sequential design implies that the tests are constructed one after the…
Descriptors: Algorithms, Computer Assisted Testing, Foreign Countries, Item Banks
Whitney, Douglas R.; And Others – 1985
This research brief summarizes the available reliability and validity data available in, but spread throughout, a number of General Educational Development (GED) Testing Service publications. A section on reliability discusses how to determine reliability of a test's scores and two ways of assessing the reliability of a test--internal consistency…
Descriptors: Adult Education, High School Equivalency Programs, Item Analysis, Scores
Lord, Frederic M. – 1982
Explored are two theoretical approaches that attempt to cope with omitted responses, that is, when an examinee omits (fails to respond to) an item and therefore the item response formula cannot be used. Preliminary considerations are discussed, and it is shown that a conveniently simple application of equivalent items leads to internal…
Descriptors: Guessing (Tests), Latent Trait Theory, Mathematical Models, Maximum Likelihood Statistics
Cook, Linda L.; And Others – 1982
Data from the Scholastic Aptitude Test-Verbal (SAT-V), SAT Mathematics (SAT-M), and Achievement Tests in Biology, American History, and Social Studies were used for this study. The temporal stability of item parameter estimates obtained for the same set of items calibrated for different examinees at different times was analyzed. It was believed…
Descriptors: Achievement Tests, Aptitude Tests, Equated Scores, Item Analysis
Reckase, Mark D.; McKinley, Robert L. – 1984
The purpose of this paper is to present a generalization of the concept of item difficulty to test items that measure more than one dimension. Three common definitions of item difficulty were considered: the proportion of correct responses for a group of individuals; the probability of a correct response to an item for a specific person; and the…
Descriptors: Difficulty Level, Item Analysis, Latent Trait Theory, Mathematical Models
Tsutakawa, Robert K.; Lin, Hsin Ying – 1984
Item response curves for a set of binary responses are studied from a Bayesian viewpoint of estimating the item parameters. For the two-parameter logistic model with normally distributed ability, restricted bivariate beta priors are used to illustrate the computation of the posterior mode via the EM algorithm. The procedure is illustrated by data…
Descriptors: Algorithms, Bayesian Statistics, College Entrance Examinations, Estimation (Mathematics)
Pages: 1  |  ...  |  376  |  377  |  378  |  379  |  380  |  381  |  382  |  383  |  384  |  ...  |  637