ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	3

Descriptor

Difficulty Level	11
Test Construction	11
Test Use	11
Test Items	6
Achievement Tests	4
Test Interpretation	4
Academic Standards	3
Elementary Secondary Education	3
Foreign Countries	3
Item Analysis	3
Language Proficiency	3
Scoring	3
Test Content	3
Comparative Analysis	2
Equated Scores	2
Item Banks	2
Language Tests	2
Multiple Choice Tests	2
Objective Tests	2
Statistical Analysis	2
Student Evaluation	2
Test Results	2
Test Validity	2
Adult Students	1
Alignment (Education)	1
More ▼

Source

Educational Measurement:…	2
Educational Assessment	1
Ministerial Council on…	1

Author

Boeijen, Marijke	1
Crawford, Gary D.	1
Davidson, Fred	1
Dodds, Jeffrey	1
Donovan, Jenny	1
Frisbie, David A.	1
Hutton, Penny	1
Lennon, Melissa	1
Mathieu, Cindy K.	1
Ridgeway, Gretchen Freiheit	1
Sinharay, Sandip	1
Traynor, Anne	1
Wu, Margaret	1
More ▼

Publication Type

Reports - Descriptive	4
Reports - Evaluative	4
Speeches/Meeting Papers	4
Journal Articles	3
Reports - Research	2
Guides - Non-Classroom	1
Information Analyses	1
Numerical/Quantitative Data	1
Tests/Questionnaires	1

Education Level

Elementary Secondary Education	2
Elementary Education	1
Grade 6	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Location

Alabama	1
Australia	1
Indiana	1
Kansas	1
Massachusetts	1
Michigan	1
Minnesota	1
Netherlands	1
New Jersey	1
Ohio	1
Oregon	1
Vermont	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	1
Program for International…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing all 11 results Save | Export

On the Choice of Anchor Tests in Equating

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2018

The choice of anchor tests is crucial in applications of the nonequivalent groups with anchor test design of equating. Sinharay and Holland (2006, 2007) suggested "miditests," which are anchor tests that are content-representative and have the same mean item difficulty as the total test but have a smaller spread of item difficulties.…

Descriptors: Test Content, Difficulty Level, Test Items, Test Construction

Does Test Item Performance Increase with Test-to-Standards Alignment?

Peer reviewed

Direct link

Traynor, Anne – Educational Assessment, 2017

Variation in test performance among examinees from different regions or national jurisdictions is often partially attributed to differences in the degree of content correspondence between local school or training program curricula, and the test of interest. This posited relationship between test-curriculum correspondence, or "alignment,"…

Descriptors: Test Items, Test Construction, Alignment (Education), Curriculum

Basic Precepts in Test Construction: Recommendations from Various Measurement Textbooks.

Download full text

Mathieu, Cindy K. – 1997

This paper presents six steps in test construction generally recommended by measurement textbook authors. The focus is primarily on paper-and-pencil achievement tests as used by class instructions, although the discussion touches on the construction of other types of assessment. The six steps are: (1) determine the test purpose; (2) determine the…

Descriptors: Achievement Tests, Difficulty Level, Measurement Techniques, Selection

Language Test Unidimensional Model Fit at Multiple Ability Levels.

Download full text

Davidson, Fred – 1995

This study examined initial evidence of changes in fit to a unidimensional model for some language tests at multiple ability levels. Seven data sets were analyzed using the first phase of exploratory factor analysis: principal component eigenvalue extraction. Each data set is analyzed at varying n-sizes: whole group; random subsample; and five…

Descriptors: Difficulty Level, Language Aptitude, Language Proficiency, Language Skills

Oral Language Proficiency Testing at the Foreign Service Institute. An Update--1983.

Crawford, Gary D.; And Others – 1983

The Foreign Service Institute (FSI) has been engaged in oral language proficiency testing theory and practice for more than 20 years. The FSI test has been consistent during this time in format, evaluation criteria, performance standards, and level definitions. Current concerns about the degree of standardization of the format and the strength of…

Descriptors: Academic Standards, Adult Students, Difficulty Level, Evaluation Criteria

Writing Good Tests for Student Grading or Research Purposes: Some Basic Precepts and Principles.

Download full text

Dodds, Jeffrey – 1999

Basic precepts for test development are described and explained as they are presented in measurement textbooks commonly used in the fields of education and psychology. The five building blocks discussed as the foundation of well-constructed tests are: (1) specification of purpose; (2) standard conditions; (3) consistency; (4) validity; and (5)…

Descriptors: Difficulty Level, Educational Research, Grading, Higher Education

The Multiple True-False Item Format: A Status Review.

Peer reviewed

Frisbie, David A. – Educational Measurement: Issues and Practice, 1992

Literature related to the multiple true-false (MTF) item format is reviewed. Each answer cluster of a MTF item may have several true items and the correctness of each is judged independently. MTF tests appear efficient and reliable, although they are a bit harder than multiple choice items for examinees. (SLD)

Descriptors: Achievement Tests, Difficulty Level, Literature Reviews, Multiple Choice Tests

Testing Listening Comprehension: Notions and Functions as "Discrete Points" in a Communicative Context.

Boeijen, Marijke – 1984

This study examines two kinds of French language tests originating from the Threshold-Level (T-Level) or Niveau-Seuil theory of language learning in the context of several linguistic and testing concepts: discrete-point testing with regard to notions and functions from T-Level theory, communicative context, and assessment of listening…

Descriptors: Communicative Competence (Languages), Criterion Referenced Tests, Difficulty Level, Foreign Countries

An Application of Latent Trait Test Methodology to a Large School District Testing Program.

Ridgeway, Gretchen Freiheit – 1982

A one-parameter latent trait model was the basis of the test development procedures in the Basic Skills Assessment Program (BSAP) of the Department of Defense Dependents Schools (DoDDS). Several issues are involved in applying the Rasch model to an assessment program in a large school district. Separate sets of skills continua are arranged by…

Descriptors: Achievement Tests, Basic Skills, Dependents Schools, Difficulty Level

Multiple-Choice Cloze Exercises: Handbook. SPPED Test Development Notebook, Form 86. Revised.

New York State Education Dept., Albany. Div. of Research. – 1977

This handbook is a guide to the "Test Development Notebook" of the System for Pupil and Program Evaluation and Development (SPPED). It describes the contents and organization of the notebook, suggests various uses of the exercises, provides sample test designs, outlines test production procedures, and offers guidelines for score…

Descriptors: Cloze Procedure, Consumer Education, Difficulty Level, Elementary Secondary Education

National Assessment Program--Science Literacy Year 6 Technical Report, 2006

Download full text

Wu, Margaret; Donovan, Jenny; Hutton, Penny; Lennon, Melissa – Ministerial Council on Education, Employment, Training and Youth Affairs (NJ1), 2008

In July 2001, the Ministerial Council on Education, Employment, Training and Youth Affairs (MCEETYA) agreed to the development of assessment instruments and key performance measures for reporting on student skills, knowledge and understandings in primary science. It directed the newly established Performance Measurement and Reporting Taskforce…

Descriptors: Foreign Countries, Scientific Literacy, Science Achievement, Comparative Analysis