NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 11 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2014
A method for medical screening is adapted to differential item functioning (DIF). Its essential elements are explicit declarations of the level of DIF that is acceptable and of the loss function that quantifies the consequences of the two kinds of inappropriate classification of an item. Instead of a single level and a single function, sets of…
Descriptors: Test Items, Test Bias, Simulation, Hypothesis Testing
Peer reviewed Peer reviewed
Direct linkDirect link
van Rijn, P. W.; Beguin, A. A.; Verstralen, H. H. F. M. – Assessment in Education: Principles, Policy & Practice, 2012
While measurement precision is relatively easy to establish for single tests and assessments, it is much more difficult to determine for decision making with multiple tests on different subjects. This latter is the situation in the system of final examinations for secondary education in the Netherlands and is used as an example in this paper. This…
Descriptors: Secondary Education, Tests, Foreign Countries, Decision Making
Choppin, Bruce; And Others – 1982
A detailed description of five latent structure models of achievement measurement is presented. The first project paper, by David L. McArthur, analyzes the history of mental testing to show how conventional item analysis procedures were developed, and how dissatisfaction with them has led to fragmentation. The range of distinct conceptual and…
Descriptors: Academic Achievement, Achievement Tests, Comparative Analysis, Data Analysis
Myers, Charles T. – 1978
The viewpoint is expressed that adding to test reliability by either selecting a more homogeneous set of items, restricting the range of item difficulty as closely as possible to the most efficient level, or increasing the number of items will not add to test validity and that there is considerable danger that efforts to increase reliability may…
Descriptors: Achievement Tests, Item Analysis, Multiple Choice Tests, Test Construction
Peer reviewed Peer reviewed
Bhaskar, R.; Dillard, Jesse F. – Instructional Science, 1983
Description of an objective method for assigning weights to questions on examinations includes discussions of classical test theory, knowledge organization, and how task analysis can be used to identify knowledge elements required to solve specific problems, rank them, and assign objective weights to exam questions using a Pareto distribution (7…
Descriptors: Accounting, Epistemology, Evaluation Methods, Item Analysis
Peer reviewed Peer reviewed
Altepeter, Tom – School Psychology Review, 1983
A critical review of the Expressive One-Word Picture Vocabulary Test (Gardner) is offered. The reviewer feels that the instrument cannot be recommended in its present form. Further research concerning the manual, and theoretical issues, (particularly test-retest stability) is strongly recommended. (Author/PN)
Descriptors: Error of Measurement, Intelligence Tests, Item Analysis, Pictorial Stimuli
Webb, Noreen; Herman, Joan – 1984
This paper describes the development of a language arts test to assess the consistency of student response patterns and the feasibility of using the test to diagnose students' misconceptions. The studies were part of a project to develop computerized adaptive testing for the language arts with software to diagnose student errors. The…
Descriptors: Adaptive Testing, Computer Assisted Testing, Diagnostic Tests, Error Patterns
Cohen, Andrew D. – 1989
A study investigated the effects of specific guidelines in the taking and rating of tests of summarizing ability. The subjects were 63 native-Hebrew-speaking students enrolled in English-as-a-Second-Language (ESL) courses at the Seminar Hakibbutzim Teacher Training College in Tel Aviv (Israel). The subjects were given two sets of instructions…
Descriptors: Answer Keys, Comparative Analysis, English (Second Language), Foreign Countries
Broussard, Rolland L. – 1985
The cultural bias of the Adult Performance Level Assessment, Form AA-l (APLA) was examined. The potential influence of cultural differences on scores of a major ethnic group, Acadians or Cajuns, was investigated. Assessment items most prone to produce differences in scores were isolated and administered to selected groups. No significant…
Descriptors: Adult Basic Education, Adult Literacy, Culture Fair Tests, Ethnic Groups
Sarvela, Paul D. – 1986
Four discrimination indices were compared, using score distributions which were normal, bimodal, and negatively skewed. The score distributions were systematically varied to represent the common circumstances of a military training situation using criterion-referenced mastery tests. Three 20-item tests were administered to 110 simulated subjects.…
Descriptors: Comparative Analysis, Criterion Referenced Tests, Item Analysis, Mastery Tests
Jacobs, Lucy Cheser; Chase, Clinton I. – 1992
This book offers specific how-to advice to college faculty on every stage of the testing process, including planning the test and classifying objectives to be measured, ensuring the validity and reliability of the test, and grading in such a way as to arrive at fair grades based on relevant data. The book examines the strengths and weaknesses of…
Descriptors: Cheating, College Faculty, Comparative Analysis, Computer Assisted Testing