NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 6,586 to 6,600 of 9,547 results Save | Export
Close Up Foundation, Arlington, VA. – 1990
Designed for students, this survey of American history, culture, government, economics, and geography tests their knowledge in these areas through a variety of questions. The questions are organized into 12 subtopics divided among 4 major categories: 5 topics under History, 5 under Government, 1 under Economics, and 1 under Geography. The topic…
Descriptors: Citizenship Education, Economics Education, Geography Instruction, High Schools
Nandakumar, Ratna – 1994
By definition, differential item functioning (DIF) refers to unequal probabilities of a correct response to a test item by examinees from two groups when controlled for their ability differences. Simulation results are presented for an attempt to purify a test by separating out multidimensional items under the assumption that the intent of the…
Descriptors: Ability, Computer Simulation, Construct Validity, Educational Assessment
Fisher, William P., Jr. – 1991
In an address to the National Council on Measurement in Education, R. M. Jaeger (1987) commented that there appears to be a fundamental difference in measurement philosophy between those on the two sides of the debate over the Rasch model. Jaeger's observations are explicated by contrasting the views on measurement of B. D. Wright and E. F.…
Descriptors: Construct Validity, Content Validity, Educational Assessment, Item Response Theory
Powers, Donald E.; And Others – 1978
Much of the effort involved in a major restructuring of the Graduate Record Examinations (GRE) Aptitude Test was intended to result in the creation of an analytical module to supplement the verbal and quantitative sections of the test, thus providing broadened measurement. Factor extension analysis was used in the present study to investigate…
Descriptors: College Entrance Examinations, Factor Analysis, Factor Structure, Graduate Study
Bogan, Evelyn Doody; Yen, Wendy M. – 1983
Four multidimensional data configurations and one unidimensional data configuration were simulated for three differences in mean difficulty between two tests to be equated. Two chi-square statistics, Q1 and Q2, were examined for their ability to detect multidimensionality. Results indicated that Q1 did not discriminate between any of the…
Descriptors: Difficulty Level, Equated Scores, Goodness of Fit, Latent Trait Theory
Roth, Rod – 1983
College faculty (n=171) from 16 Arkansas colleges were asked to make validity and cut score judgments about the test items for the 1982 Arkansas National Teacher Examination (NTE) study of 23 area examinations. Each of the 23 data collection panels began with a training session which included specific directions for the estimates of the judges.…
Descriptors: College Faculty, Cutting Scores, Evaluation Criteria, Higher Education
Haladyna, Thomas M. – 1984
The purpose of this study is to examine an option-weighting method as it affects pass-fail decisions in formative and summative evaluation of student achievement for instructional units, certification, advancement, licensure, admissions, placement, and selection. A database was constructed using high school achievement test data where a…
Descriptors: Achievement Tests, Cutting Scores, High Schools, Multiple Choice Tests
Robinson, Rhonda S. – 1984
Turner's (1980) visual literacy test for high school students and adults was adapted for use with eighth grade students. The new version was limited to questions dealing with motion media, and a half-hour "M.A.S.H." narrative television program was chosen for the focus on television production techniques and the narrative elements of the…
Descriptors: Difficulty Level, Formative Evaluation, Research Methodology, Secondary School Students
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Robin, Frédéric; van der Linden, Wim J.; Eignor, Daniel R.; Steffen, Manfred; Stocking, Martha L. – ETS Research Report Series, 2005
The relatively new shadow test approach (STA) to computerized adaptive testing (CAT) proposed by Wim van der Linden is a potentially attractive alternative to the weighted deviation algorithm (WDA) implemented at ETS. However, it has not been evaluated under testing conditions representative of current ETS testing programs. Of interest was whether…
Descriptors: Test Construction, Computer Assisted Testing, Simulation, Evaluation Methods
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Kim, Sooyeon; von Davier, Alina A.; Haberman, Shelby – ETS Research Report Series, 2006
This study addresses the sample error and linking bias that occur with small and unrepresentative samples in a non-equivalent groups anchor test (NEAT) design. We propose a linking method called the "synthetic function," which is a weighted average of the identity function (the trivial equating function for forms that are known to be…
Descriptors: Equated Scores, Sample Size, Test Items, Statistical Bias
Federal Aviation Administration (DOT), Washington, DC. – 1989
This question book was developed by the Federal Aviation Administration (FAA) to be used by FAA testing centers and FAA-designated written test examiners when administering the flight engineer written test. The book can be used to test applicants in the following flight engineer knowledge areas: basic, turbojet powered, turbopropeller powered, and…
Descriptors: Aircraft Pilots, Aviation Education, Aviation Technology, Aviation Vocabulary
Fisk, Yvette Hester – 1991
The reasons for recent endeavors to evaluate item bias are discussed, and item bias is defined. Some of the literature regarding the most promising methods of detecting item bias is reviewed. Three classes of methods for detecting item bias are discussed using concrete examples and illustrations. These methods are: (1) latent trait; (2)…
Descriptors: Chi Square, Comparative Analysis, Difficulty Level, Item Bias
Samejima, Fumiko – 1990
A method is proposed that increases the accuracies of estimation of the operating characteristics of discrete item responses, especially when the true operating characteristic is represented by a steep curve, and also at the lower and upper ends of the ability distribution where the estimation tends to be inaccurate because of the smaller number…
Descriptors: Ability Identification, Adaptive Testing, Computer Assisted Testing, Equations (Mathematics)
Rubin, Donald L. – 1989
Because the language of a multiple choice test is formal and often unfamiliar, certain linguistic features may lead a test-taker to misconstrue the test instructions, questions, or answers. When this happens, a shared understanding of meaning between tester and test-taker is not present, and the test results are invalid. Although this problem…
Descriptors: Black Dialects, Dialect Studies, English, Item Bias
Tollefson, Nona; Chen, Ju Shan – 1986
This study compared item difficulty and item discrimination indices for parallel multiple-choice items in three content areas: measurement concepts, statistical terminology, and synonyms. The statistics and measurement items were administered in classes where graduate students taking the test were studying the content. Vocabulary items represented…
Descriptors: Difficulty Level, Graduate Students, Higher Education, Item Analysis
Pages: 1  |  ...  |  436  |  437  |  438  |  439  |  440  |  441  |  442  |  443  |  444  |  ...  |  637