ERIC - Search Results

Publication Date

In 2026	0
Since 2025	5
Since 2022 (last 5 years)	45
Since 2017 (last 10 years)	91
Since 2007 (last 20 years)	144

Descriptor

Test Format	418
Test Reliability	418
Test Validity	243
Test Construction	135
Test Items	119
Higher Education	88
Multiple Choice Tests	68
Foreign Countries	67
Testing	65
Test Interpretation	61
Comparative Analysis	57
Language Tests	57
Computer Assisted Testing	55
Scores	53
Scoring	51
Student Evaluation	46
Psychometrics	44
Test Use	44
Standardized Tests	43
Elementary Secondary Education	40
Item Analysis	40
Test Content	40
College Students	36
Second Language Learning	36
Test Reviews	36
More ▼

Education Level

Higher Education	50
Postsecondary Education	42
Secondary Education	25
Elementary Education	24
Middle Schools	17
Junior High Schools	15
High Schools	10
Grade 8	9
Grade 7	8
Early Childhood Education	7
Elementary Secondary Education	7
Grade 3	7
Grade 5	7
Intermediate Grades	7
Grade 4	6
Grade 6	6
Primary Education	6
Adult Education	2
Kindergarten	2
Grade 1	1
Grade 9	1
Preschool Education	1
More ▼

Audience

Practitioners	33
Teachers	23
Administrators	18
Researchers	12
Community	1
Counselors	1
Policymakers	1
Students	1
Support Staff	1

Location

New York	9
Turkey	8
California	7
Canada	6
Japan	6
Germany	4
United Kingdom	4
Georgia	3
Israel	3
France	2
Indonesia	2
Iran	2
Netherlands	2
New York (New York)	2
Nigeria	2
Singapore	2
South Africa	2
United Kingdom (Great Britain)	2
Bangladesh	1
Brazil	1
China	1
Connecticut	1
Czech Republic	1
Estonia	1
Finland	1
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	1
Job Training Partnership Act…	1
No Child Left Behind Act 2001	1
Pell Grant Program	1

What Works Clearinghouse Rating

Test Reliability X

Showing 136 to 150 of 418 results Save | Export

The Contribution of Constructed Response Items to Large Scale Assessment: Measuring and Understanding Their Impact

Peer reviewed

Direct link

Lissitz, Robert W.; Hou, Xiaodong; Slater, Sharon Cadman – Journal of Applied Testing Technology, 2012

This article investigates several questions regarding the impact of different item formats on measurement characteristics. Constructed response (CR) items and multiple choice (MC) items obviously differ in their formats and in the resources needed to score them. As such, they have been the subject of considerable discussion regarding the impact of…

Descriptors: Computer Assisted Testing, Scoring, Evaluation Problems, Psychometrics

Test Review: Review of the Certificate of Proficiency in English (CPE) Speaking Test

Peer reviewed

Direct link

Macqueen, Susy; Harding, Luke – Language Testing, 2009

In 2002 the University of Cambridge Local Examinations Syndicate (UCLES) implemented a revised version of the Certificate of Proficiency in English (CPE). CPE, which is the highest level of the Main Suite of Cambridge ESOL exams, comprises five modules, "Reading," "Writing," "Use of English," "Listening" and "Speaking," the latter of which is the…

Descriptors: Speech Communication, Test Reviews, Examiners, English (Second Language)

Test Review: Test of English as a Foreign Language[TM]--Internet-Based Test (TOEFL iBT[R])

Peer reviewed

Direct link

Alderson, J. Charles – Language Testing, 2009

In this article, the author reviews the TOEFL iBT which is the latest version of the TOEFL, whose history stretches back to 1961. The TOEFL iBT was introduced in the USA, Canada, France, Germany and Italy in late 2005. Currently the TOEFL test is offered in two testing formats: (1) Internet-based testing (iBT); and (2) paper-based testing (PBT).…

Descriptors: Oral Language, Writing Tests, Listening Comprehension Tests, Test Reviews

New York State Alternate Assessment Technical Report, 2013-14

Download full text

New York State Education Department, 2014

This technical report provides an overview of the New York State Alternate Assessment (NYSAA), including a description of the purpose of the NYSAA, the processes utilized to develop and implement the NYSAA program, and Stakeholder involvement in those processes. The purpose of this report is to document the technical aspects of the 2013-14 NYSAA.…

Descriptors: Alternative Assessment, Educational Assessment, State Departments of Education, Student Evaluation

Developing a Clinician-Friendly Aphasia Test

Peer reviewed

Direct link

Marshall, Robert C.; Wright, Heather Harris – American Journal of Speech-Language Pathology, 2007

Purpose: The Kentucky Aphasia Test (KAT) is an objective measure of language functioning for persons with aphasia. This article describes materials, administration, and scoring of the KAT; presents the rationale for development of test items; reports information from a pilot study; and discusses the role of the KAT in aphasia assessment. Method:…

Descriptors: Aphasia, Test Format, Language Tests, Expressive Language

Is There a "Best" Way to Test? Assessing Assessment.

Hoachlander, E. Gareth – Techniques: Making Education and Career Connections, 1998

Discusses state testing, various types of tests, and whether the increased attention to assessment is contributing to improved student learning. Describes uses of standardized multiple-choice, open-ended constructed response, essay, performance event, and portfolio methods. (JOW)

Descriptors: Academic Achievement, Student Evaluation, Test Format, Test Reliability

Can a Good Short Form of the MMPI Ever Be Developed?

Peer reviewed

Streiner, David L.; Miller, Harold R. – Journal of Clinical Psychology, 1986

Numerous short forms of the Minnesota Multiphasic Personality Inventory have been proposed in the last 15 years. In each case, the initial enthusiasm has been replaced by the questions about the clinical utility of the abbreviated version. Argues that the statistical properties of the test and reduced reliability due to shortening the scales…

Descriptors: Test Construction, Test Format, Test Length, Test Reliability

Mixed Standard Scale Response Inconsistencies as Reliability Indices.

Peer reviewed

Benson, Philip G.; Dickinson, Terry L. – Educational and Psychological Measurement, 1983

The mixed standard scale is a rating format that allows researchers to count internally inconsistent response patterns. This study investigated the meaning of these counts, using 943 accountants as raters. The counts of internally inconsistent response patterns were not related to reliability as measured by Cronbach's alpha. (Author/BW)

Descriptors: Accountants, Adults, Error Patterns, Rating Scales

Classroom Test Writing: Effects of Item Format on Test Quality.

Peer reviewed

Torabi-Parizi, Rosa; Campbell, Noma Jo – Elementary School Journal, 1982

Investigates the effects of varying the placement of blanks and the number of options available in multiple-choice items on the reliability of fifth-grade students' scores. Results indicate that scores on three-choice item tests were not less reliable than scores on four-choice item tests. A similar finding was found regarding the placement of…

Descriptors: Elementary Education, Elementary School Students, Scores, Test Format

Measurement Error and Changes in Personal Constructs.

Peer reviewed

Chambers, William V. – Social Behavior and Personality, 1985

Personal construct psychologists have suggested various psychological functions explain differences in the stability of constructs. Among these functions are constellatory and loose construction. This paper argues that measurement error is a more parsimonious explanation of the differences in construct stability reported in these studies. (Author)

Descriptors: Error of Measurement, Test Construction, Test Format, Test Reliability

Validity and Reliability of True-False Tests.

Peer reviewed

Grosse, Martin E.; Wright, Benjamin D. – Educational and Psychological Measurement, 1985

A model of examinee behavior was used to generate hypotheses about the operation of true-false scores. Confirmation of hypotheses supported the contention that true-false scores contain an error component that makes these tests less reliable than multiple-choice tests. Examinee response style may invalidate a total true-false score. (Author/DWH)

Descriptors: Objective Tests, Response Style (Tests), Test Format, Test Reliability

Using Results on k Out of n System Reliability to Study and Characterize Tests.

Peer reviewed

Wilcox, Rand R. – Educational and Psychological Measurement, 1982

Results in the engineering literature on "k out of n system reliability" can be used to characterize tests based on estimates of the probability of correctly determining whether the examinee knows the correct response. In particular, the minimum number of distractors required for multiple-choice tests can be empirically determined.…

Descriptors: Achievement Tests, Mathematical Models, Multiple Choice Tests, Test Format

Children's Attributional Style Questionnaire-Revised: Psychometric Evaluation.

Peer reviewed

Thompson, Martie; Kaslow, Nadine J.; Weiss, Bahr; Nolen-Hoeksema, Susan – Psychological Assessment, 1998

The psychometric properties of the Children's Attributional Style Questionnaire-Revised (CASQ) (N. Kaslow and S. Nolen-Hoeksema, 1991) were studied with 1086 children, 9 to 12 years old. Results indicate the revised version to be somewhat less reliable than the original, but with equivalent criterion-related validity for self-reported depression.…

Descriptors: Attribution Theory, Concurrent Validity, Psychometrics, Racial Differences

A Test of Alternative Cloze Test Formats at the Sixth-Grade Level.

Peer reviewed

Helfeldt, John P.; And Others – Journal of Educational Research, 1986

Performances of 64 sixth-grade readers on a traditional and three alternative types of cloze tests were compared. Results confirm and extend the findings of earlier studies investigating cloze alternatives. Advantages of the alternate forms are discussed. (Author/MT)

Descriptors: Cloze Procedure, Grade 6, Intermediate Grades, Reading Comprehension

The Factor Structure of the Bem Sex-Role Inventory (BSRI): Confirmatory Analysis of Long and Short Forms.

Peer reviewed

Campbell, Todd; And Others – Educational and Psychological Measurement, 1997

The construct validity of scores from the Bem Sex-Role Inventory was studied using confirmatory factor analysis methods on data from 791 subjects. Measurement characteristics of the long and short forms were studied, with the short form yielding more reliable scores, as has previously been indicated. (Author/SLD)

Descriptors: Adults, Construct Validity, Factor Structure, Scores

« Previous Page | Next Page »

Pages: 1 | ... | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | ... | 28

Diagnostique	26
Educational and Psychological…	22
Journal of Educational…	9
Language Testing	9
New York State Education…	9
Psychological Assessment	7
Applied Psychological…	5
ETS Research Report Series	5
International Journal of…	5
Journal of Reading	5
Language Assessment Quarterly	5
Applied Measurement in…	4
Assessment	4
ProQuest LLC	4
Assessment & Evaluation in…	3
Assessment for Effective…	3
College Board	3
Evaluation and the Health…	3
Grantee Submission	3
Journal of Experimental…	3
Journal of Psychoeducational…	3
Perceptual and Motor Skills	3
Practical Assessment,…	3
Academic Medicine	2
Annual Review of Applied…	2
More ▼

White, Edward M.	6
Melancon, Janet G.	4
Thompson, Bruce	4
Trevisan, Michael S.	4
Federico, Pat-Anthony	3
Frisbie, David A.	3
Hambleton, Ronald K.	3
Sax, Gilbert	3
Stansfield, Charles W.	3
Straus, Murray A.	3
Aiken, Lewis R.	2
Alderson, J. Charles	2
Brown, James Dean	2
Bush, Martin	2
Conoyer, Sarah J.	2
Eignor, Daniel R.	2
Green, Kathy	2
Hamby, Sherry L.	2
Hendrickson, Amy	2
Henk, William A.	2
Henning, Grant	2
Kapes, Jerome T.	2
Liskin-Gasparro, Judith E.	2
Menold, Natalja	2
More ▼

Journal Articles	265
Reports - Research	239
Speeches/Meeting Papers	63
Reports - Descriptive	61
Reports - Evaluative	57
Information Analyses	25
Opinion Papers	24
Guides - Non-Classroom	21
Tests/Questionnaires	20
Guides - Classroom - Teacher	10
Guides - General	6
Numerical/Quantitative Data	5
Dissertations/Theses -…	4
Reference Materials -…	4
Books	3
ERIC Publications	1
Guides - Classroom - Learner	1
Non-Print Media	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	5
Embedded Figures Test	3
SAT (College Admission Test)	3
Wechsler Adult Intelligence…	3
Wechsler Intelligence Scale…	3
ACT Assessment	2
Beck Depression Inventory	2
Graduate Record Examinations	2
Minnesota Multiphasic…	2
Peabody Picture Vocabulary…	2
ACTFL Oral Proficiency…	1
Armed Services Vocational…	1
Attribution Style…	1
Behavior Assessment System…	1
Bem Sex Role Inventory	1
Bruininks Oseretsky Test of…	1
California Critical Thinking…	1
Canfield Learning Styles…	1
Computer Attitude Scale	1
Conflict Tactics Scale	1
Conners Rating Scales	1
Cornell Critical Thinking Test	1
Defining Issues Test	1
Developmental Indicators for…	1
Dimensions of Self Concept	1
More ▼