ERIC - Search Results

Publication Date

In 2026	0
Since 2025	5
Since 2022 (last 5 years)	45
Since 2017 (last 10 years)	91
Since 2007 (last 20 years)	144

Descriptor

Test Format	418
Test Reliability	418
Test Validity	243
Test Construction	135
Test Items	119
Higher Education	88
Multiple Choice Tests	68
Foreign Countries	67
Testing	65
Test Interpretation	61
Comparative Analysis	57
Language Tests	57
Computer Assisted Testing	55
Scores	53
Scoring	51
Student Evaluation	46
Psychometrics	44
Test Use	44
Standardized Tests	43
Elementary Secondary Education	40
Item Analysis	40
Test Content	40
College Students	36
Second Language Learning	36
Test Reviews	36
More ▼

Education Level

Higher Education	50
Postsecondary Education	42
Secondary Education	25
Elementary Education	24
Middle Schools	17
Junior High Schools	15
High Schools	10
Grade 8	9
Grade 7	8
Early Childhood Education	7
Elementary Secondary Education	7
Grade 3	7
Grade 5	7
Intermediate Grades	7
Grade 4	6
Grade 6	6
Primary Education	6
Adult Education	2
Kindergarten	2
Grade 1	1
Grade 9	1
Preschool Education	1
More ▼

Audience

Practitioners	33
Teachers	23
Administrators	18
Researchers	12
Community	1
Counselors	1
Policymakers	1
Students	1
Support Staff	1

Location

New York	9
Turkey	8
California	7
Canada	6
Japan	6
Germany	4
United Kingdom	4
Georgia	3
Israel	3
France	2
Indonesia	2
Iran	2
Netherlands	2
New York (New York)	2
Nigeria	2
Singapore	2
South Africa	2
United Kingdom (Great Britain)	2
Bangladesh	1
Brazil	1
China	1
Connecticut	1
Czech Republic	1
Estonia	1
Finland	1
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	1
Job Training Partnership Act…	1
No Child Left Behind Act 2001	1
Pell Grant Program	1

What Works Clearinghouse Rating

Test Reliability X

Showing 61 to 75 of 418 results Save | Export

Can Reliability of Multiple Component Measuring Instruments Depend on Response Option Presentation Mode?

Peer reviewed

Direct link

Menold, Natalja; Raykov, Tenko – Educational and Psychological Measurement, 2016

This article examines the possible dependency of composite reliability on presentation format of the elements of a multi-item measuring instrument. Using empirical data and a recent method for interval estimation of group differences in reliability, we demonstrate that the reliability of an instrument need not be the same when polarity of the…

Descriptors: Test Reliability, Test Format, Test Items, Differences

Effects of Situational Judgment Test Format on Reliability and Validity

Peer reviewed

Direct link

Martin-Raugh, Michelle P.; Anguiano-Carrsaco, Cristina; Jackson, Teresa; Brenneman, Meghan W.; Carney, Lauren; Barnwell, Patrick; Kochert, Jonathan – International Journal of Testing, 2018

Single-response situational judgment tests (SRSJTs) differ from multiple-response SJTs (MRSJTS) in that they present test takers with edited critical incidents and simply ask test takers to read over the action described and evaluate it according to its effectiveness. Research comparing the reliability and validity of SRSJTs and MRSJTs is thus far…

Descriptors: Test Format, Test Reliability, Test Validity, Predictive Validity

Statistical Properties of the "GRE"® Psychology Test Subscores. ETS GRE® Board Research Report. ETS GRE®-18-02. ETS Research Report. RR-18-19

Peer reviewed
PDF on ERIC

Download full text

Liu, Yuming; Robin, Frédéric; Yoo, Hanwook; Manna, Venessa – ETS Research Report Series, 2018

The "GRE"® Psychology test is an achievement test that measures core knowledge in 12 content domains that represent the courses commonly offered at the undergraduate level. Currently, a total score and 2 subscores, experimental and social, are reported to test takers as well as graduate institutions. However, the American Psychological…

Descriptors: College Entrance Examinations, Graduate Study, Psychological Testing, Scores

New York State Testing Program: English Language Arts, Mathematics, and Elementary-Level (Grade 5) and Intermediate-Level (Grade 8) Science Field Tests. School Administrator's Manual for Computer-Based Field Testing. May 16-June 3, 2022. Grades 3-8

Download full text

New York State Education Department, 2022

The instructions in this manual explain the responsibilities of school administrators for the New York State Testing Program (NYSTP) Grades 3-8 English Language Arts and Mathematics Field Tests, and the Elementary-level (Grade 5) and Intermediate-level (Grade 8) Science Field Tests. School administrators must be thoroughly familiar with the…

Descriptors: Testing Programs, Mathematics Tests, Test Format, Computer Assisted Testing

A Review of the IELTS Test: Focus on Validity, Reliability, and Washback

Peer reviewed
PDF on ERIC

Download full text

Ali Hashemi; Samran Daneshfar – Indonesian Journal of English Language Teaching and Applied Linguistics, 2018

The International English Language Test System (IELTS) is one of the most reputable English tests that is used to assess the language proficiency of those who intend to study or work in an English speaking context. It is one of the most largescale proficiency tests which affects the lives of many students, as well as immigrants as the results of…

Descriptors: Test Validity, Test Reliability, Testing Problems, Language Proficiency

New York State Testing Program: English Language Arts and Mathematics Tests. School Administrator's Manual, v202, 2021. Grades 3-8

Download full text

New York State Education Department, 2021

The instructions in this manual explain the responsibilities of school administrators for the New York State Testing Program (NYSTP) Grades 3-8 English Language Arts and Mathematics Tests. School administrators must be thoroughly familiar with the contents of the manual, and the policies and procedures must be followed as written so that testing…

Descriptors: Testing Programs, Mathematics Tests, Test Format, Computer Assisted Testing

Computerized Testing in Reading Comprehension Skill: Investigating Score Interchangeability, Item Review, Age and Gender Stereotypes, ICT Literacy and Computer Attitudes

Peer reviewed

Direct link

Toroujeni, Seyyed Morteza Hashemi – Education and Information Technologies, 2022

Score interchangeability of Computerized Fixed-Length Linear Testing (henceforth CFLT) and Paper-and-Pencil-Based Testing (henceforth PPBT) has become a controversial issue over the last decade when technology has meaningfully restructured methods of the educational assessment. Given this controversy, various testing guidelines published on…

Descriptors: Computer Assisted Testing, Reading Tests, Reading Comprehension, Scoring

Reliability and Validity Evidence for the English and Spanish Preschool Narrative Language Measures-Listening

Peer reviewed
PDF on ERIC

Download full text

Direct link

Trina D. Spencer; Marilyn S. Thompson; Douglas B. Petersen; Yixing Liu; M. Adelaida Restrepo – Grantee Submission, 2023

For young Spanish-speaking children entering U. S. schools, it is imperative that educators foster growth in the home language and in the language of instruction to the fullest extent possible. Monitoring language development over time is crucial for promoting language development because it allows educators to individualize student instruction.…

Descriptors: Spanish Speaking, English (Second Language), Second Language Learning, Native Language

Same Test, Better Scores: Boosting the Reliability of Short Online Intelligence Recruitment Tests with Nested Logit Item Response Theory Models

Peer reviewed
PDF on ERIC

Download full text

Storme, Martin; Myszkowski, Nils; Baron, Simon; Bernard, David – Journal of Intelligence, 2019

Assessing job applicants' general mental ability online poses psychometric challenges due to the necessity of having brief but accurate tests. Recent research (Myszkowski & Storme, 2018) suggests that recovering distractor information through Nested Logit Models (NLM; Suh & Bolt, 2010) increases the reliability of ability estimates in…

Descriptors: Intelligence Tests, Item Response Theory, Comparative Analysis, Test Reliability

Reliability and Structure of the TALIS Social Desirability Scale: An Assessment Based on Item Response Theory

Peer reviewed

Direct link

Kapuza, A. V.; Tyumeneva, Yu. A. – Russian Education & Society, 2017

One of the ways of controlling for the influence of social expectations on the answers given by survey respondents is to use a social desirability scale together with the main questions. The social desirability scale, which was included in the Teaching and Learning International Survey (TALIS) international comparative study for this purpose, was…

Descriptors: Surveys, Social Desirability, Measures (Individuals), Test Reliability

ACTFL Oral Proficiency Interview -- Computer (OPIc)

Peer reviewed

Direct link

Isbell, Dan; Winke, Paula – Language Testing, 2019

The American Council on the Teaching of Foreign Languages (ACTFL) oral proficiency interview -- computer (OPIc) testing system represents an ambitious effort in language assessment: Assessing oral proficiency in over a dozen languages, on the same scale, from virtually anywhere at any time. Especially for users in contexts where multiple foreign…

Descriptors: Oral Language, Language Tests, Language Proficiency, Second Language Learning

Assessing Equivalency of PSC-17 Ratings: Does It Matter if Mixed or Grouped Item Format Is Used?

Peer reviewed

Direct link

DiStefano, Christine; Barth, Steven G.; Greer, Fred – Journal of Psychoeducational Assessment, 2019

This study investigated the effect of item position on descriptive statistics, psychometric information, and factor structure of the Pediatric Symptoms Checklist 17-item social-emotional screening instrument (PSC-17). The goal was to determine whether item position, either grouped by factor or mixed across constructs, produced similar results.…

Descriptors: Check Lists, Test Items, Factor Structure, Screening Tests

A Comparison of Two Content Area Curriculum-Based Measurement Tools

Peer reviewed

Direct link

Ford, Jeremy W.; Conoyer, Sarah J.; Lembke, Erica S.; Smith, R. Alex; Hosp, John L. – Assessment for Effective Intervention, 2018

In the present study, two types of curriculum-based measurement (CBM) tools in science, Vocabulary Matching (VM) and Statement Verification for Science (SV-S), a modified Sentence Verification Technique, were compared. Specifically, this study aimed to determine whether the format of information presented (i.e., SV-S vs. VM) produces differences…

Descriptors: Curriculum Based Assessment, Evaluation Methods, Measurement Techniques, Comparative Analysis

Tablets Instead of Paper-Based Tests for Young Children? Comparability between Paper and Tablet Versions of the Mathematical Heidelberger Rechen Test 1-4

Peer reviewed

Direct link

Hassler Hallstedt, Martin; Ghaderi, Ata – Educational Assessment, 2018

Tablets can be used to facilitate systematic testing of academic skills. Yet, when using validated paper tests on tablet, comparability between the mediums must be established. Comparability between a tablet and a paper version of a basic math skills test (HRT: Heidelberger Rechen Test 1-4) was investigated. Five samples with second and third…

Descriptors: Handheld Devices, Scores, Test Format, Computer Assisted Testing

Multiple True-False Items: A Comparison of Scoring Algorithms

Peer reviewed

Direct link

Lahner, Felicitas-Maria; Lörwald, Andrea Carolin; Bauer, Daniel; Nouns, Zineb Miriam; Krebs, René; Guttormsen, Sissel; Fischer, Martin R.; Huwendiek, Sören – Advances in Health Sciences Education, 2018

Multiple true-false (MTF) items are a widely used supplement to the commonly used single-best answer (Type A) multiple choice format. However, an optimal scoring algorithm for MTF items has not yet been established, as existing studies yielded conflicting results. Therefore, this study analyzes two questions: What is the optimal scoring algorithm…

Descriptors: Scoring Formulas, Scoring Rubrics, Objective Tests, Multiple Choice Tests

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 28

Diagnostique	26
Educational and Psychological…	22
Journal of Educational…	9
Language Testing	9
New York State Education…	9
Psychological Assessment	7
Applied Psychological…	5
ETS Research Report Series	5
International Journal of…	5
Journal of Reading	5
Language Assessment Quarterly	5
Applied Measurement in…	4
Assessment	4
ProQuest LLC	4
Assessment & Evaluation in…	3
Assessment for Effective…	3
College Board	3
Evaluation and the Health…	3
Grantee Submission	3
Journal of Experimental…	3
Journal of Psychoeducational…	3
Perceptual and Motor Skills	3
Practical Assessment,…	3
Academic Medicine	2
Annual Review of Applied…	2
More ▼

White, Edward M.	6
Melancon, Janet G.	4
Thompson, Bruce	4
Trevisan, Michael S.	4
Federico, Pat-Anthony	3
Frisbie, David A.	3
Hambleton, Ronald K.	3
Sax, Gilbert	3
Stansfield, Charles W.	3
Straus, Murray A.	3
Aiken, Lewis R.	2
Alderson, J. Charles	2
Brown, James Dean	2
Bush, Martin	2
Conoyer, Sarah J.	2
Eignor, Daniel R.	2
Green, Kathy	2
Hamby, Sherry L.	2
Hendrickson, Amy	2
Henk, William A.	2
Henning, Grant	2
Kapes, Jerome T.	2
Liskin-Gasparro, Judith E.	2
Menold, Natalja	2
More ▼

Journal Articles	265
Reports - Research	239
Speeches/Meeting Papers	63
Reports - Descriptive	61
Reports - Evaluative	57
Information Analyses	25
Opinion Papers	24
Guides - Non-Classroom	21
Tests/Questionnaires	20
Guides - Classroom - Teacher	10
Guides - General	6
Numerical/Quantitative Data	5
Dissertations/Theses -…	4
Reference Materials -…	4
Books	3
ERIC Publications	1
Guides - Classroom - Learner	1
Non-Print Media	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	5
Embedded Figures Test	3
SAT (College Admission Test)	3
Wechsler Adult Intelligence…	3
Wechsler Intelligence Scale…	3
ACT Assessment	2
Beck Depression Inventory	2
Graduate Record Examinations	2
Minnesota Multiphasic…	2
Peabody Picture Vocabulary…	2
ACTFL Oral Proficiency…	1
Armed Services Vocational…	1
Attribution Style…	1
Behavior Assessment System…	1
Bem Sex Role Inventory	1
Bruininks Oseretsky Test of…	1
California Critical Thinking…	1
Canfield Learning Styles…	1
Computer Attitude Scale	1
Conflict Tactics Scale	1
Conners Rating Scales	1
Cornell Critical Thinking Test	1
Defining Issues Test	1
Developmental Indicators for…	1
Dimensions of Self Concept	1
More ▼