Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 2 |
Descriptor
Item Analysis | 11 |
Test Theory | 11 |
Testing Problems | 11 |
Test Items | 6 |
Comparative Analysis | 4 |
Test Construction | 4 |
Test Reliability | 4 |
Statistical Analysis | 3 |
Test Validity | 3 |
Testing | 3 |
Achievement Tests | 2 |
More ▼ |
Source
Assessment in Education:… | 1 |
Instructional Science | 1 |
Journal of Educational and… | 1 |
School Psychology Review | 1 |
Author
Altepeter, Tom | 1 |
Beguin, A. A. | 1 |
Bhaskar, R. | 1 |
Broussard, Rolland L. | 1 |
Chase, Clinton I. | 1 |
Choppin, Bruce | 1 |
Cohen, Andrew D. | 1 |
Dillard, Jesse F. | 1 |
Herman, Joan | 1 |
Jacobs, Lucy Cheser | 1 |
Longford, Nicholas T. | 1 |
More ▼ |
Publication Type
Reports - Research | 6 |
Journal Articles | 4 |
Reports - Descriptive | 2 |
Speeches/Meeting Papers | 2 |
Books | 1 |
Guides - Non-Classroom | 1 |
Opinion Papers | 1 |
Reports - Evaluative | 1 |
Education Level
Secondary Education | 1 |
Audience
Researchers | 2 |
Practitioners | 1 |
Teachers | 1 |
Location
Israel | 1 |
Netherlands | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Expressive One Word Picture… | 1 |
What Works Clearinghouse Rating
Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2014
A method for medical screening is adapted to differential item functioning (DIF). Its essential elements are explicit declarations of the level of DIF that is acceptable and of the loss function that quantifies the consequences of the two kinds of inappropriate classification of an item. Instead of a single level and a single function, sets of…
Descriptors: Test Items, Test Bias, Simulation, Hypothesis Testing
van Rijn, P. W.; Beguin, A. A.; Verstralen, H. H. F. M. – Assessment in Education: Principles, Policy & Practice, 2012
While measurement precision is relatively easy to establish for single tests and assessments, it is much more difficult to determine for decision making with multiple tests on different subjects. This latter is the situation in the system of final examinations for secondary education in the Netherlands and is used as an example in this paper. This…
Descriptors: Secondary Education, Tests, Foreign Countries, Decision Making
Choppin, Bruce; And Others – 1982
A detailed description of five latent structure models of achievement measurement is presented. The first project paper, by David L. McArthur, analyzes the history of mental testing to show how conventional item analysis procedures were developed, and how dissatisfaction with them has led to fragmentation. The range of distinct conceptual and…
Descriptors: Academic Achievement, Achievement Tests, Comparative Analysis, Data Analysis
Myers, Charles T. – 1978
The viewpoint is expressed that adding to test reliability by either selecting a more homogeneous set of items, restricting the range of item difficulty as closely as possible to the most efficient level, or increasing the number of items will not add to test validity and that there is considerable danger that efforts to increase reliability may…
Descriptors: Achievement Tests, Item Analysis, Multiple Choice Tests, Test Construction

Bhaskar, R.; Dillard, Jesse F. – Instructional Science, 1983
Description of an objective method for assigning weights to questions on examinations includes discussions of classical test theory, knowledge organization, and how task analysis can be used to identify knowledge elements required to solve specific problems, rank them, and assign objective weights to exam questions using a Pareto distribution (7…
Descriptors: Accounting, Epistemology, Evaluation Methods, Item Analysis

Altepeter, Tom – School Psychology Review, 1983
A critical review of the Expressive One-Word Picture Vocabulary Test (Gardner) is offered. The reviewer feels that the instrument cannot be recommended in its present form. Further research concerning the manual, and theoretical issues, (particularly test-retest stability) is strongly recommended. (Author/PN)
Descriptors: Error of Measurement, Intelligence Tests, Item Analysis, Pictorial Stimuli
Webb, Noreen; Herman, Joan – 1984
This paper describes the development of a language arts test to assess the consistency of student response patterns and the feasibility of using the test to diagnose students' misconceptions. The studies were part of a project to develop computerized adaptive testing for the language arts with software to diagnose student errors. The…
Descriptors: Adaptive Testing, Computer Assisted Testing, Diagnostic Tests, Error Patterns
Cohen, Andrew D. – 1989
A study investigated the effects of specific guidelines in the taking and rating of tests of summarizing ability. The subjects were 63 native-Hebrew-speaking students enrolled in English-as-a-Second-Language (ESL) courses at the Seminar Hakibbutzim Teacher Training College in Tel Aviv (Israel). The subjects were given two sets of instructions…
Descriptors: Answer Keys, Comparative Analysis, English (Second Language), Foreign Countries
Broussard, Rolland L. – 1985
The cultural bias of the Adult Performance Level Assessment, Form AA-l (APLA) was examined. The potential influence of cultural differences on scores of a major ethnic group, Acadians or Cajuns, was investigated. Assessment items most prone to produce differences in scores were isolated and administered to selected groups. No significant…
Descriptors: Adult Basic Education, Adult Literacy, Culture Fair Tests, Ethnic Groups
Sarvela, Paul D. – 1986
Four discrimination indices were compared, using score distributions which were normal, bimodal, and negatively skewed. The score distributions were systematically varied to represent the common circumstances of a military training situation using criterion-referenced mastery tests. Three 20-item tests were administered to 110 simulated subjects.…
Descriptors: Comparative Analysis, Criterion Referenced Tests, Item Analysis, Mastery Tests
Jacobs, Lucy Cheser; Chase, Clinton I. – 1992
This book offers specific how-to advice to college faculty on every stage of the testing process, including planning the test and classifying objectives to be measured, ensuring the validity and reliability of the test, and grading in such a way as to arrive at fair grades based on relevant data. The book examines the strengths and weaknesses of…
Descriptors: Cheating, College Faculty, Comparative Analysis, Computer Assisted Testing