ERIC - Search Results

Publication Date

In 2026	0
Since 2025	222
Since 2022 (last 5 years)	1091
Since 2017 (last 10 years)	2601
Since 2007 (last 20 years)	4962

Descriptor

Test Items	9554
Test Construction	2724
Foreign Countries	2186
Item Response Theory	1872
Difficulty Level	1624
Item Analysis	1502
Test Validity	1418
Test Reliability	1189
Multiple Choice Tests	1160
Scores	1138
Computer Assisted Testing	1058
Comparative Analysis	1024
Test Format	956
Higher Education	877
Achievement Tests	855
Statistical Analysis	852
Mathematics Tests	847
Psychometrics	836
Test Bias	772
Models	754
Student Evaluation	736
Language Tests	699
Correlation	695
Evaluation Methods	674
Scoring	633
More ▼

Author

van der Linden, Wim J.	69
Tindal, Gerald	50
Hambleton, Ronald K.	45
Alonzo, Julie	41
Chang, Hua-Hua	40
Plake, Barbara S.	40
Sinharay, Sandip	37
Reckase, Mark D.	36
Wainer, Howard	33
Dorans, Neil J.	32
Gierl, Mark J.	30
Sireci, Stephen G.	28
Wang, Wen-Chung	26
Cohen, Allan S.	25
Meijer, Rob R.	25
Samejima, Fumiko	24
Stocking, Martha L.	24
Anderson, Daniel	23
Zwick, Rebecca	23
Veldkamp, Bernard P.	22
Haladyna, Thomas M.	21
Kim, Seock-Ho	21
Wise, Steven L.	21
Kim, Sooyeon	20
More ▼

Education Level

Higher Education	1316
Postsecondary Education	1066
Secondary Education	929
Elementary Education	717
Middle Schools	422
High Schools	364
Elementary Secondary Education	359
Junior High Schools	322
Grade 8	257
Intermediate Grades	209
Grade 4	183
Early Childhood Education	178
Grade 5	134
Primary Education	126
Grade 7	113
Grade 3	111
Grade 6	107
Grade 9	70
Grade 2	56
Grade 10	53
Grade 12	52
Kindergarten	50
Adult Education	39
Grade 11	38
Grade 1	36
More ▼

Audience

Practitioners	653
Teachers	563
Researchers	250
Students	201
Administrators	81
Policymakers	22
Parents	17
Counselors	8
Community	7
Support Staff	3
Media Staff	1
More ▼

Location

Turkey	227
Canada	223
Australia	155
Germany	116
United States	99
China	90
Florida	86
Indonesia	82
Taiwan	78
United Kingdom	73
California	66
Japan	65
Netherlands	64
Iran	62
United Kingdom (England)	57
South Africa	48
New York	46
Missouri	45
Oklahoma	44
South Korea	44
Malaysia	42
Texas	42
Sweden	38
Israel	37
Singapore	37
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	4
Meets WWC Standards with or without Reservations	4
Does not meet standards	1

Showing 6,076 to 6,090 of 9,554 results Save | Export

The Impact of the "All-of-the-Above" Option and Student Ability on Multiple Choice Tests

Peer reviewed
PDF on ERIC

Download full text

Huang, Yi-Min; Trevisan, Mike; Storfer, Andrew – International Journal for the Scholarship of Teaching and Learning, 2007

Despite the prevalence of multiple choice items in educational testing, there is a dearth of empirical evidence for multiple choice item writing rules. The purpose of this study was to expand the base of empirical evidence by examining the use of the "all-of-the-above" option in a multiple choice examination in order to assess how…

Descriptors: Multiple Choice Tests, Educational Testing, Ability Grouping, Test Format

Aligning Science Assessment Standards: Louisiana and the 2009 National Assessment of Educational Progress (NAEP). Issues & Answers. REL 2007-No. 020

Peer reviewed
PDF on ERIC

Download full text

Timms, Michael; Schneider, Steven; Lee, Cindy; Rolfhus, Eric – Regional Educational Laboratory Southwest (NJ1), 2007

This policy research document is intended for Louisiana policymakers to use when examining possible changes to the state assessment's alignment with the National Assessment of Educational Progress (NAEP). The 2009 NAEP test is not yet in existence, so the purpose of this report is to give policymakers a head start in determining where they might,…

Descriptors: Federal Legislation, Test Items, Testing, Science Tests

How the Collapsing of Categories Impacts the Item Information Function in Polytomous Item Response Theory.

Download full text

Lecointe, Darius A. – 1995

The purpose of this Item Response Theory study was to investigate how the expected reduction in item information, due to the collapsing of response categories in performance assessment data, was affected by varying testing conditions: item difficulty, item discrimination, inter-rater reliability, and direction of collapsing. The investigation used…

Descriptors: Classification, Computer Simulation, Difficulty Level, Interrater Reliability

The Robustness of BILOG to Violations of the Assumptions of Unidimensionality of Test Items and Normality of Ability Distribution.

PDF pending restoration

Kirisci, Levent; Hsu, Tse-Chi – 1995

The main goal of this study was to assess how sensitive unidimensional parameter estimates derived from BILOG were when the unidimensionality assumption was violated and the underlying ability distribution was not multivariate normal. A multidimensional three-parameter logistic distribution that was a straightforward generalization of the…

Descriptors: Ability, Comparative Analysis, Correlation, Difficulty Level

Constructed Response and Differential Item Functioning: A Pragmatic Approach.

Download full text

Dorans, Neil J.; Schmitt, Alicia P. – 1991

Differential item functioning (DIF) assessment attempts to identify items or item types for which subpopulations of examinees exhibit performance differentials that are not consistent with the performance differentials typically seen for those subpopulations on collections of items that purport to measure a common construct. DIF assessment…

Descriptors: Computer Assisted Testing, Constructed Response, Educational Assessment, Item Bias

Reliability of the Test of Spoken English Revisited. Research Reports, Report 40.

Download full text

Boldt, R. F. – 1992

The Test of Spoken English (TSE) is an internationally administered instrument for assessing nonnative speakers' proficiency in speaking English. The research foundation of the TSE examination described in its manual refers to two sources of variation other than the achievement being measured: interrater reliability and internal consistency.…

Descriptors: Adults, Analysis of Variance, Interrater Reliability, Language Proficiency

The Three Stage Development of the NTE School Psychologist Specialty Area Test from a Job Analysis Perspective.

Download full text

DeMauro, Gerald E. – 1990

Three papers describe the three stages of developing the National Teacher Examination (NTE) School Psychologist Specialty Area Test. The first stage is described in the paper entitled "Knowledge Areas Important to School Psychology." A survey of the membership of the National Association of School Psychologists helped determine knowledge…

Descriptors: Certification, Elementary Secondary Education, Job Analysis, Job Skills

An Analytical Evaluation of Two Common-Odds Ratios as Population Indicators of DIF.

Download full text

Pommerich, Mary; And Others – 1995

The Mantel-Haenszel (MH) statistic for identifying differential item functioning (DIF) commonly conditions on the observed test score as a surrogate for conditioning on latent ability. When the comparison group distributions are not completely overlapping (i.e., are incongruent), the observed score represents different levels of latent ability…

Descriptors: Ability, Comparative Analysis, Difficulty Level, Item Bias

The Central Role of Content Representation in Test Validity.

Download full text

Sireci, Stephen G. – 1995

The purpose of this paper is to clarify the seemingly discrepant views of test theorists and test developers about terminology related to the evaluation of test content. The origin and evolution of the concept of content validity are traced, and the concept is reformulated in a way that emphasizes the notion that content domain definition,…

Descriptors: Construct Validity, Content Validity, Definitions, Item Analysis

The Interplay of Evidence and Consequences in the Validation of Performance Assessments. Research Report.

Download full text

Messick, Samuel – 1992

Authentic and direct assessments of performances and products are conceptualized in terms of multiple distinctions having implications for validation. These include contrasts between performances and products, between assessment of performance per se and performance assessment of competence or other constructs, between structured and unstructured…

Descriptors: Cognitive Processes, Competence, Educational Assessment, Evaluation Methods

How Unidimensional Are Tests Comprising Both Multiple-Choice and Free-Response Items? An Analysis of Two Tests. Program Statistics Research Technical Report No. 93-32.

Download full text

Wainer, Howard; And Others – 1993

The relationship between the multiple-choice and free-response sections of the Computer Science and Chemistry tests of the College Board's Advanced Placement program was studied. Confirmatory factor analysis showed that the free-response sections measure the same underlying proficiency as the multiple-choice sections for the most part. However,…

Descriptors: Advanced Placement, Chemistry, Computer Science, High School Students

Using Rasch To Create Measures from Survey Data (or Making a Silk Purse out of a Sow's Ear).

Download full text

Bode, Rita K. – 1995

This study describes the creation of measures of teachers' use of ability grouping in instruction using Rasch analysis. The dimensionality of the proposed construct was also investigated. Results of the Rasch analysis are compared to the results using composites to illustrate how the description of a construct can vary depending on the method used…

Descriptors: Ability Grouping, Classification, Educational Practices, Item Response Theory

Heresies of the New Unified Notion of Test Validity.

Download full text

Stuck, Ivan – 1995

By focusing on "appropriateness" and "adequacy" of inference and action, unified validity may be misused in rejecting valid test outcomes. The notion of levels of validity is challenged, the necessity of assumption is argued, and experience is proposed as the basis of validity. "Consequential validity" is interpreted as an optional predictive…

Descriptors: Evaluation Methods, Measurement Techniques, Measures (Individuals), Predictive Validity

Stochastic Order in Dichotomous Item Response Models for Fixed Tests, Adaptive Tests, or Multiple Abilities. Research Report 95-02.

Download full text

van der Linden, Wim J. – 1995

Dichotomous item response theory (IRT) models can be viewed as families of stochastically ordered distributions of responses to test items. This paper explores several properties of such distributions. The focus is on the conditions under which stochastic order in families of conditional distributions is transferred to their inverse distributions,…

Descriptors: Ability, Adaptive Testing, Computer Assisted Testing, Foreign Countries

How Contaminated by Guessing Are Item-Parameter Estimates and What Can Be Done about It?

Download full text

Schnipke, Deborah L. – 1996

When running out of time on a multiple-choice test, some examinees are likely to respond rapidly to the remaining unanswered items in an attempt to get some items right by chance. Because these responses will tend to be incorrect, the presence of "rapid-guessing behavior" could cause these items to appear to be more difficult than they…

Descriptors: Difficulty Level, Estimation (Mathematics), Guessing (Tests), Item Response Theory

« Previous Page | Next Page »

Pages: 1 | ... | 402 | 403 | 404 | 405 | 406 | 407 | 408 | 409 | 410 | ... | 637

Educational and Psychological…	416
Journal of Educational…	367
ProQuest LLC	246
Applied Psychological…	234
Applied Measurement in…	231
ETS Research Report Series	146
Educational Measurement:…	129
Journal of Educational and…	122
Online Submission	115
International Journal of…	105
Grantee Submission	98
Language Testing	93
Psychometrika	93
International Journal of…	80
Journal of Psychoeducational…	72
Educational Assessment	70
Practical Assessment,…	60
Measurement:…	57
Language Assessment Quarterly	55
Journal of Chemical Education	54
Behavioral Research and…	50
Journal of Experimental…	45
Physical Review Physics…	40
Journal of Experimental…	36
International Journal of…	35
More ▼

Journal Articles	5889
Reports - Research	5599
Reports - Evaluative	1556
Speeches/Meeting Papers	1168
Reports - Descriptive	796
Tests/Questionnaires	768
Guides - Classroom - Teacher	472
Guides - Non-Classroom	259
Dissertations/Theses -…	251
Numerical/Quantitative Data	185
Information Analyses	179
Opinion Papers	164
Guides - Classroom - Learner	162
Books	54
Collected Works - General	33
Multilingual/Bilingual…	32
Guides - General	31
Reports - General	21
Book/Product Reviews	20
ERIC Publications	20
Non-Print Media	16
ERIC Digests in Full Text	14
Collected Works - Proceedings	13
Reference Materials - General	13
Collected Works - Serials	12
More ▼

No Child Left Behind Act 2001	36
Individuals with Disabilities…	20
Every Student Succeeds Act…	5
Elementary and Secondary…	4
Race to the Top	4
Rehabilitation Act 1973…	4
Elementary and Secondary…	3
Head Start	3
Americans with Disabilities…	2
Comprehensive Education…	2
Higher Education Act…	2
Immigration Reform and…	2
Civil Rights Act 1964	1
Civil Rights Act 1964 Title…	1
Comprehensive Employment and…	1
Education Consolidation…	1
Education for All Handicapped…	1
Fair Labor Standards Act	1
Higher Education Act Title II	1
Higher Education Opportunity…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Jeanne Clery Disclosure of…	1
Job Training Partnership Act…	1
Kentucky Education Reform Act…	1
More ▼

National Assessment of…	183
Program for International…	179
SAT (College Admission Test)	137
Trends in International…	114
Test of English as a Foreign…	85
Graduate Record Examinations	74
ACT Assessment	44
Advanced Placement…	34
Texas Educational Assessment…	32
Law School Admission Test	30
Wechsler Intelligence Scale…	26
Iowa Tests of Basic Skills	25
Progress in International…	25
Stanford Achievement Tests	24
Raven Progressive Matrices	22
Armed Services Vocational…	20
International English…	20
Peabody Picture Vocabulary…	20
California Achievement Tests	18
Comprehensive Tests of Basic…	18
Test of English for…	17
Metropolitan Achievement Tests	15
General Educational…	14
Graduate Management Admission…	14
Wechsler Adult Intelligence…	13
More ▼