Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 11 |
Descriptor
Difficulty Level | 20 |
Test Validity | 20 |
Test Construction | 12 |
Test Items | 12 |
Test Reliability | 9 |
Language Tests | 5 |
Testing | 5 |
English (Second Language) | 4 |
Higher Education | 4 |
Multiple Choice Tests | 4 |
Reading Tests | 4 |
More ▼ |
Source
Author
Liu, Kimy | 2 |
Tindal, Gerald | 2 |
Alonzo, Julie | 1 |
Amy Clark | 1 |
Arth, Thomas O. | 1 |
Baghaei, Purya | 1 |
Benderson, Albert, Ed. | 1 |
Burdett, Newman | 1 |
Cawthon, Stephanie W. | 1 |
Dodds, Jeffrey | 1 |
Domyancich, John M. | 1 |
More ▼ |
Publication Type
Reports - Descriptive | 20 |
Journal Articles | 10 |
Speeches/Meeting Papers | 3 |
Numerical/Quantitative Data | 2 |
Tests/Questionnaires | 2 |
Collected Works - Serials | 1 |
Education Level
Elementary Education | 3 |
Grade 3 | 2 |
Grade 4 | 2 |
Grade 5 | 2 |
Secondary Education | 2 |
Grade 1 | 1 |
Grade 2 | 1 |
Grade 6 | 1 |
Grade 7 | 1 |
Grade 8 | 1 |
Higher Education | 1 |
More ▼ |
Audience
Location
Japan | 1 |
United Kingdom | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 1 |
Test of English for… | 1 |
What Works Clearinghouse Rating
Baghaei, Purya; Kubinger, Klaus D. – Practical Assessment, Research & Evaluation, 2015
The present paper gives a general introduction to the linear logistic test model (Fischer, 1973), an extension of the Rasch model with linear constraints on item parameters, along with eRm (an R package to estimate different types of Rasch models; Mair, Hatzinger, & Mair, 2014) functions to estimate the model and interpret its parameters. The…
Descriptors: Item Response Theory, Models, Test Validity, Hypothesis Testing
Sue Bechard; Amy Clark; Russell Swinburne Romine; Meagan Karvonen; Neal Kingston; Karen Erickson – International Journal of Testing, 2019
Evidence-based approaches to assessment design, development, and administration provide a strong foundation for an assessment's validity argument but can be time consuming, resource intensive, and complex to implement. This article describes an evidence-based approach used for one assessment that addresses these challenges. Evidence-centered…
Descriptors: Evidence Based Practice, Test Construction, Test Validity, Measurement
Luecht, Richard M. – Journal of Applied Testing Technology, 2013
Assessment engineering is a new way to design and implement scalable, sustainable and ideally lower-cost solutions to the complexities of designing and developing tests. It represents a merger of sorts between cognitive task modeling and engineering design principles--a merger that requires some new thinking about the nature of score scales, item…
Descriptors: Engineering, Test Construction, Test Items, Models
Mitchell, Alison M.; Truckenmiller, Adrea; Petscher, Yaacov – Communique, 2015
As part of the Race to the Top initiative, the United States Department of Education made nearly 1 billion dollars available in State Educational Technology grants with the goal of ramping up school technology. One result of this effort is that states, districts, and schools across the country are using computerized assessments to measure their…
Descriptors: Computer Assisted Testing, Educational Technology, Testing, Efficiency
Domyancich, John M. – Journal of Chemical Education, 2014
Multiple-choice questions are an important part of large-scale summative assessments, such as the advanced placement (AP) chemistry exam. However, past AP chemistry exam items often lacked the ability to test conceptual understanding and higher-order cognitive skills. The redesigned AP chemistry exam shows a distinctive shift in item types toward…
Descriptors: Multiple Choice Tests, Science Instruction, Chemistry, Summative Evaluation
Burdett, Newman – National Foundation for Educational Research, 2015
This election factsheet highlights the following points: (1) While the GCSE pass rate has increased since its introduction, this doesn't tell us very much about how standards have changed. Evidence from international surveys suggests that education standards have remained stable. Stopping the use of modules and limiting resits is likely to reduce…
Descriptors: Secondary Education, Academic Standards, Educational Change, Achievement Gains
Henning, Grant – English Teaching Forum, 2012
To some extent, good testing procedure, like good language use, can be achieved through avoidance of errors. Almost any language-instruction program requires the preparation and administration of tests, and it is only to the extent that certain common testing mistakes have been avoided that such tests can be said to be worthwhile selection,…
Descriptors: Testing, English (Second Language), Testing Problems, Student Evaluation
Tristan, Agustin; Vidal, Rafael – Online Submission, 2007
Wright and Stone had proposed three features to assess the quality of the distribution of the items difficulties in a test, on the so called "most probable response map": line, stack and gap. Once a line is accepted as a design model for a test, gaps and stacks are practically eliminated, producing an evidence of the "scale…
Descriptors: Test Validity, Models, Difficulty Level, Test Items
Cawthon, Stephanie W.; Ho, Eching; Patel, Puja G.; Potvin, Deborah C.; Trundt, Katherine M. – Practical Assessment, Research & Evaluation, 2009
Students with disabilities frequently use accommodations to participate in large-scale, standardized assessments. Accommodations can include changes to the administration of the test, such as extended time, changes to the test items, such as read aloud, or changes to the student's response, such as the use of a scribe. Some accommodations or…
Descriptors: Test Items, Student Evaluation, Test Validity, Student Characteristics
Jung, Eunju; Liu, Kimy; Ketterlin-Geller, Leanne R.; Tindal, Gerald – Behavioral Research and Teaching, 2008
The purpose of this study was to develop general outcome measures (GOM) in mathematics so that teachers could focus their instruction on needed prerequisite skills. We describe in detail, the manner in which content-related evidence was established and then present a number of statistical analyses conducted to evaluate the technical adequacy of…
Descriptors: Item Analysis, Test Construction, Test Theory, Mathematics Tests

Laufer, Batia; Nation, Paul – Language Testing, 1999
Investigated the reliability, validity, and practicality of a controlled production measure of vocabulary, consisting of items from five frequency levels and using a completion-item format. Two equivalent test forms were compared. The test was found to be useful in distinguishing between different proficiency groups. (Author/MSE)
Descriptors: Difficulty Level, Language Tests, Second Languages, Test Construction
Alonzo, Julie; Liu, Kimy; Tindal, Gerald – Behavioral Research and Teaching, 2007
In this technical report, the authors describe the development and piloting of reading comprehension measures as part of a comprehensive progress monitoring literacy assessment system developed in 2006 for use with students in Kindergarten through fifth grade. They begin with a brief overview of the two conceptual frameworks underlying the…
Descriptors: Reading Comprehension, Emergent Literacy, Test Construction, Literacy Education

Pray, W. Stephen; Popovich, Nicholas G. – American Journal of Pharmaceutical Education, 1985
Test development included designing, screening, and field testing of test items; compilation into an examination administered to a target group; and norm development for score comparison with a national sample. (MSE)
Descriptors: Difficulty Level, Doctoral Programs, Higher Education, Item Analysis
Educational Testing Service, Princeton, NJ. – 1986
The final project report on development of an advanced Russian language listening and reading proficiency test is presented. It summarizes activities in the second year of the project, including dissemination of summer 1985 test validation results to participating higher education institutions, item analyses, completion of the final test edition,…
Descriptors: Advanced Courses, Difficulty Level, Higher Education, Language Proficiency
Dodds, Jeffrey – 1999
Basic precepts for test development are described and explained as they are presented in measurement textbooks commonly used in the fields of education and psychology. The five building blocks discussed as the foundation of well-constructed tests are: (1) specification of purpose; (2) standard conditions; (3) consistency; (4) validity; and (5)…
Descriptors: Difficulty Level, Educational Research, Grading, Higher Education
Previous Page | Next Page ยป
Pages: 1 | 2