ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	11

Descriptor

Difficulty Level	20
Test Validity	20
Test Construction	12
Test Items	12
Test Reliability	9
Language Tests	5
Testing	5
English (Second Language)	4
Higher Education	4
Multiple Choice Tests	4
Reading Tests	4
Comparative Analysis	3
Foreign Countries	3
Item Analysis	3
Language Proficiency	3
Models	3
Scoring	3
Student Evaluation	3
Academic Standards	2
Computer Assisted Testing	2
Disabilities	2
Educational Research	2
Elementary School Students	2
Evaluation Methods	2
Grading	2
More ▼

Source

Behavioral Research and…	2
Practical Assessment,…	2
American Journal of…	1
Communique	1
English Teaching Forum	1
Focus	1
International Journal of…	1
Journal of Applied Testing…	1
Journal of Chemical Education	1
Language Testing	1
National Foundation for…	1
Online Submission	1
Thought Currents in English…	1
More ▼

Publication Type

Reports - Descriptive	20
Journal Articles	10
Speeches/Meeting Papers	3
Numerical/Quantitative Data	2
Tests/Questionnaires	2
Collected Works - Serials	1

Education Level

Elementary Education	3
Grade 3	2
Grade 4	2
Grade 5	2
Secondary Education	2
Grade 1	1
Grade 2	1
Grade 6	1
Grade 7	1
Grade 8	1
Higher Education	1
Kindergarten	1
Middle Schools	1
Postsecondary Education	1
More ▼

Audience

Location

Japan	1
United Kingdom	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	1
Test of English for…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 20 results Save | Export

Linear Logistic Test Modeling with R

Peer reviewed
PDF on ERIC

Download full text

Baghaei, Purya; Kubinger, Klaus D. – Practical Assessment, Research & Evaluation, 2015

The present paper gives a general introduction to the linear logistic test model (Fischer, 1973), an extension of the Rasch model with linear constraints on item parameters, along with eRm (an R package to estimate different types of Rasch models; Mair, Hatzinger, & Mair, 2014) functions to estimate the model and interpret its parameters. The…

Descriptors: Item Response Theory, Models, Test Validity, Hypothesis Testing

Use of Evidence-Centered Design to Develop Learning Maps-Based Assessments

Peer reviewed

Direct link

Sue Bechard; Amy Clark; Russell Swinburne Romine; Meagan Karvonen; Neal Kingston; Karen Erickson – International Journal of Testing, 2019

Evidence-based approaches to assessment design, development, and administration provide a strong foundation for an assessment's validity argument but can be time consuming, resource intensive, and complex to implement. This article describes an evidence-based approach used for one assessment that addresses these challenges. Evidence-centered…

Descriptors: Evidence Based Practice, Test Construction, Test Validity, Measurement

Assessment Engineering Task Model Maps, Task Models and Templates as a New Way to Develop and Implement Test Specifications

Peer reviewed

Direct link

Luecht, Richard M. – Journal of Applied Testing Technology, 2013

Assessment engineering is a new way to design and implement scalable, sustainable and ideally lower-cost solutions to the complexities of designing and developing tests. It represents a merger of sorts between cognitive task modeling and engineering design principles--a merger that requires some new thinking about the nature of score scales, item…

Descriptors: Engineering, Test Construction, Test Items, Models

Computer-Adaptive Assessments: Fundamentals and Considerations

Direct link

Mitchell, Alison M.; Truckenmiller, Adrea; Petscher, Yaacov – Communique, 2015

As part of the Race to the Top initiative, the United States Department of Education made nearly 1 billion dollars available in State Educational Technology grants with the goal of ramping up school technology. One result of this effort is that states, districts, and schools across the country are using computerized assessments to measure their…

Descriptors: Computer Assisted Testing, Educational Technology, Testing, Efficiency

The Development of Multiple-Choice Items Consistent with the AP Chemistry Curriculum Framework to More Accurately Assess Deeper Understanding

Peer reviewed

Direct link

Domyancich, John M. – Journal of Chemical Education, 2014

Multiple-choice questions are an important part of large-scale summative assessments, such as the advanced placement (AP) chemistry exam. However, past AP chemistry exam items often lacked the ability to test conceptual understanding and higher-order cognitive skills. The redesigned AP chemistry exam shows a distinctive shift in item types toward…

Descriptors: Multiple Choice Tests, Science Instruction, Chemistry, Summative Evaluation

GCSE and A Level Reform: Are the New Qualifications Returning a "Gold Standard" of Assessment? Election Factsheet

Download full text

Burdett, Newman – National Foundation for Educational Research, 2015

This election factsheet highlights the following points: (1) While the GCSE pass rate has increased since its introduction, this doesn't tell us very much about how standards have changed. Evidence from international surveys suggests that education standards have remained stable. Stopping the use of modules and limiting resits is likely to reduce…

Descriptors: Secondary Education, Academic Standards, Educational Change, Achievement Gains

Twenty Common Testing Mistakes for EFL Teachers to Avoid

Download full text

Henning, Grant – English Teaching Forum, 2012

To some extent, good testing procedure, like good language use, can be achieved through avoidance of errors. Almost any language-instruction program requires the preparation and administration of tests, and it is only to the extent that certain common testing mistakes have been avoided that such tests can be said to be worthwhile selection,…

Descriptors: Testing, English (Second Language), Testing Problems, Student Evaluation

Linear Model to Assess the Scale's Validity of a Test

Download full text

Tristan, Agustin; Vidal, Rafael – Online Submission, 2007

Wright and Stone had proposed three features to assess the quality of the distribution of the items difficulties in a test, on the so called "most probable response map": line, stack and gap. Once a line is accepted as a design model for a test, gaps and stacks are practically eliminated, producing an evidence of the "scale…

Descriptors: Test Validity, Models, Difficulty Level, Test Items

Multiple Constructs and Effects of Accommodations on Accommodated Test Scores for Students with Disabilities

Peer reviewed

Direct link

Cawthon, Stephanie W.; Ho, Eching; Patel, Puja G.; Potvin, Deborah C.; Trundt, Katherine M. – Practical Assessment, Research & Evaluation, 2009

Students with disabilities frequently use accommodations to participate in large-scale, standardized assessments. Accommodations can include changes to the administration of the test, such as extended time, changes to the test items, such as read aloud, or changes to the student's response, such as the use of a scribe. Some accommodations or…

Descriptors: Test Items, Student Evaluation, Test Validity, Student Characteristics

Instrument Development Procedures for Mathematics Measures. Technical Report Number 08-02

Download full text

Jung, Eunju; Liu, Kimy; Ketterlin-Geller, Leanne R.; Tindal, Gerald – Behavioral Research and Teaching, 2008

The purpose of this study was to develop general outcome measures (GOM) in mathematics so that teachers could focus their instruction on needed prerequisite skills. We describe in detail, the manner in which content-related evidence was established and then present a number of statistical analyses conducted to evaluate the technical adequacy of…

Descriptors: Item Analysis, Test Construction, Test Theory, Mathematics Tests

A Vocabulary-Size Test of Controlled Productive Ability.

Peer reviewed

Laufer, Batia; Nation, Paul – Language Testing, 1999

Investigated the reliability, validity, and practicality of a controlled production measure of vocabulary, consisting of items from five frequency levels and using a completion-item format. Two equivalent test forms were compared. The test was found to be useful in distinguishing between different proficiency groups. (Author/MSE)

Descriptors: Difficulty Level, Language Tests, Second Languages, Test Construction

Examining the Technical Adequacy of Reading Comprehension Measures in a Progress Monitoring Assessment System. Technical Report # 41

Download full text

Alonzo, Julie; Liu, Kimy; Tindal, Gerald – Behavioral Research and Teaching, 2007

In this technical report, the authors describe the development and piloting of reading comprehension measures as part of a comprehensive progress monitoring literacy assessment system developed in 2006 for use with students in Kindergarten through fifth grade. They begin with a brief overview of the two conceptual frameworks underlying the…

Descriptors: Reading Comprehension, Emergent Literacy, Test Construction, Literacy Education

The Development of a Standardized Competency Examination for Doctor of Pharmacy Students.

Peer reviewed

Pray, W. Stephen; Popovich, Nicholas G. – American Journal of Pharmaceutical Education, 1985

Test development included designing, screening, and field testing of test items; compilation into an examination administered to a target group; and norm development for score comparison with a national sample. (MSE)

Descriptors: Difficulty Level, Doctoral Programs, Higher Education, Item Analysis

Advanced Russian Listening and Reading Proficiency Test. Final Project Report--Year 2.

Download full text

Educational Testing Service, Princeton, NJ. – 1986

The final project report on development of an advanced Russian language listening and reading proficiency test is presented. It summarizes activities in the second year of the project, including dissemination of summer 1985 test validation results to participating higher education institutions, item analyses, completion of the final test edition,…

Descriptors: Advanced Courses, Difficulty Level, Higher Education, Language Proficiency

Writing Good Tests for Student Grading or Research Purposes: Some Basic Precepts and Principles.

Download full text

Dodds, Jeffrey – 1999

Basic precepts for test development are described and explained as they are presented in measurement textbooks commonly used in the fields of education and psychology. The five building blocks discussed as the foundation of well-constructed tests are: (1) specification of purpose; (2) standard conditions; (3) consistency; (4) validity; and (5)…

Descriptors: Difficulty Level, Educational Research, Grading, Higher Education

Previous Page | Next Page »

Pages: 1 | 2

Liu, Kimy	2
Tindal, Gerald	2
Alonzo, Julie	1
Amy Clark	1
Arth, Thomas O.	1
Baghaei, Purya	1
Benderson, Albert, Ed.	1
Burdett, Newman	1
Cawthon, Stephanie W.	1
Dodds, Jeffrey	1
Domyancich, John M.	1
Henning, Grant	1
Herzog, Martha	1
Ho, Eching	1
Hofmeister, Alan M.	1
Jung, Eunju	1
Karen Erickson	1
Ketterlin-Geller, Leanne R.	1
Kubinger, Klaus D.	1
Laufer, Batia	1
Luecht, Richard M.	1
Meagan Karvonen	1
Mitchell, Alison M.	1
Nation, Paul	1
Neal Kingston	1
More ▼