ERIC - Search Results

Publication Date

In 2026	0
Since 2025	200
Since 2022 (last 5 years)	1070
Since 2017 (last 10 years)	2580
Since 2007 (last 20 years)	4941

Descriptor

Test Items	9533
Test Construction	2717
Foreign Countries	2181
Item Response Theory	1868
Difficulty Level	1620
Item Analysis	1501
Test Validity	1415
Test Reliability	1186
Multiple Choice Tests	1156
Scores	1136
Computer Assisted Testing	1057
Comparative Analysis	1024
Test Format	956
Higher Education	877
Achievement Tests	854
Statistical Analysis	850
Mathematics Tests	845
Psychometrics	832
Test Bias	770
Models	753
Student Evaluation	736
Language Tests	699
Correlation	695
Evaluation Methods	674
Scoring	633
More ▼

Author

van der Linden, Wim J.	69
Tindal, Gerald	50
Hambleton, Ronald K.	45
Alonzo, Julie	41
Chang, Hua-Hua	40
Plake, Barbara S.	40
Sinharay, Sandip	37
Reckase, Mark D.	36
Wainer, Howard	33
Dorans, Neil J.	32
Gierl, Mark J.	30
Sireci, Stephen G.	28
Wang, Wen-Chung	26
Cohen, Allan S.	25
Meijer, Rob R.	25
Samejima, Fumiko	24
Stocking, Martha L.	24
Anderson, Daniel	23
Zwick, Rebecca	23
Veldkamp, Bernard P.	22
Haladyna, Thomas M.	21
Kim, Seock-Ho	21
Wise, Steven L.	21
Kim, Sooyeon	20
More ▼

Education Level

Higher Education	1310
Postsecondary Education	1060
Secondary Education	925
Elementary Education	715
Middle Schools	419
High Schools	362
Elementary Secondary Education	358
Junior High Schools	319
Grade 8	255
Intermediate Grades	209
Grade 4	183
Early Childhood Education	177
Grade 5	134
Primary Education	126
Grade 7	113
Grade 3	111
Grade 6	107
Grade 9	68
Grade 2	56
Grade 10	52
Grade 12	52
Kindergarten	50
Adult Education	39
Grade 11	37
Grade 1	36
More ▼

Audience

Practitioners	653
Teachers	563
Researchers	250
Students	201
Administrators	81
Policymakers	22
Parents	17
Counselors	8
Community	7
Support Staff	3
Media Staff	1
More ▼

Location

Turkey	225
Canada	223
Australia	155
Germany	116
United States	99
China	90
Florida	86
Indonesia	82
Taiwan	78
United Kingdom	73
California	65
Japan	65
Netherlands	64
Iran	62
United Kingdom (England)	57
South Africa	48
Missouri	45
New York	45
Oklahoma	44
South Korea	44
Malaysia	42
Texas	42
Israel	37
Singapore	37
Sweden	37
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	4
Meets WWC Standards with or without Reservations	4
Does not meet standards	1

Showing 6,061 to 6,075 of 9,533 results Save | Export

Constructed Response and Differential Item Functioning: A Pragmatic Approach.

Download full text

Dorans, Neil J.; Schmitt, Alicia P. – 1991

Differential item functioning (DIF) assessment attempts to identify items or item types for which subpopulations of examinees exhibit performance differentials that are not consistent with the performance differentials typically seen for those subpopulations on collections of items that purport to measure a common construct. DIF assessment…

Descriptors: Computer Assisted Testing, Constructed Response, Educational Assessment, Item Bias

Reliability of the Test of Spoken English Revisited. Research Reports, Report 40.

Download full text

Boldt, R. F. – 1992

The Test of Spoken English (TSE) is an internationally administered instrument for assessing nonnative speakers' proficiency in speaking English. The research foundation of the TSE examination described in its manual refers to two sources of variation other than the achievement being measured: interrater reliability and internal consistency.…

Descriptors: Adults, Analysis of Variance, Interrater Reliability, Language Proficiency

The Three Stage Development of the NTE School Psychologist Specialty Area Test from a Job Analysis Perspective.

Download full text

DeMauro, Gerald E. – 1990

Three papers describe the three stages of developing the National Teacher Examination (NTE) School Psychologist Specialty Area Test. The first stage is described in the paper entitled "Knowledge Areas Important to School Psychology." A survey of the membership of the National Association of School Psychologists helped determine knowledge…

Descriptors: Certification, Elementary Secondary Education, Job Analysis, Job Skills

An Analytical Evaluation of Two Common-Odds Ratios as Population Indicators of DIF.

Download full text

Pommerich, Mary; And Others – 1995

The Mantel-Haenszel (MH) statistic for identifying differential item functioning (DIF) commonly conditions on the observed test score as a surrogate for conditioning on latent ability. When the comparison group distributions are not completely overlapping (i.e., are incongruent), the observed score represents different levels of latent ability…

Descriptors: Ability, Comparative Analysis, Difficulty Level, Item Bias

The Central Role of Content Representation in Test Validity.

Download full text

Sireci, Stephen G. – 1995

The purpose of this paper is to clarify the seemingly discrepant views of test theorists and test developers about terminology related to the evaluation of test content. The origin and evolution of the concept of content validity are traced, and the concept is reformulated in a way that emphasizes the notion that content domain definition,…

Descriptors: Construct Validity, Content Validity, Definitions, Item Analysis

The Interplay of Evidence and Consequences in the Validation of Performance Assessments. Research Report.

Download full text

Messick, Samuel – 1992

Authentic and direct assessments of performances and products are conceptualized in terms of multiple distinctions having implications for validation. These include contrasts between performances and products, between assessment of performance per se and performance assessment of competence or other constructs, between structured and unstructured…

Descriptors: Cognitive Processes, Competence, Educational Assessment, Evaluation Methods

How Unidimensional Are Tests Comprising Both Multiple-Choice and Free-Response Items? An Analysis of Two Tests. Program Statistics Research Technical Report No. 93-32.

Download full text

Wainer, Howard; And Others – 1993

The relationship between the multiple-choice and free-response sections of the Computer Science and Chemistry tests of the College Board's Advanced Placement program was studied. Confirmatory factor analysis showed that the free-response sections measure the same underlying proficiency as the multiple-choice sections for the most part. However,…

Descriptors: Advanced Placement, Chemistry, Computer Science, High School Students

Using Rasch To Create Measures from Survey Data (or Making a Silk Purse out of a Sow's Ear).

Download full text

Bode, Rita K. – 1995

This study describes the creation of measures of teachers' use of ability grouping in instruction using Rasch analysis. The dimensionality of the proposed construct was also investigated. Results of the Rasch analysis are compared to the results using composites to illustrate how the description of a construct can vary depending on the method used…

Descriptors: Ability Grouping, Classification, Educational Practices, Item Response Theory

Heresies of the New Unified Notion of Test Validity.

Download full text

Stuck, Ivan – 1995

By focusing on "appropriateness" and "adequacy" of inference and action, unified validity may be misused in rejecting valid test outcomes. The notion of levels of validity is challenged, the necessity of assumption is argued, and experience is proposed as the basis of validity. "Consequential validity" is interpreted as an optional predictive…

Descriptors: Evaluation Methods, Measurement Techniques, Measures (Individuals), Predictive Validity

Stochastic Order in Dichotomous Item Response Models for Fixed Tests, Adaptive Tests, or Multiple Abilities. Research Report 95-02.

Download full text

van der Linden, Wim J. – 1995

Dichotomous item response theory (IRT) models can be viewed as families of stochastically ordered distributions of responses to test items. This paper explores several properties of such distributions. The focus is on the conditions under which stochastic order in families of conditional distributions is transferred to their inverse distributions,…

Descriptors: Ability, Adaptive Testing, Computer Assisted Testing, Foreign Countries

How Contaminated by Guessing Are Item-Parameter Estimates and What Can Be Done about It?

Download full text

Schnipke, Deborah L. – 1996

When running out of time on a multiple-choice test, some examinees are likely to respond rapidly to the remaining unanswered items in an attempt to get some items right by chance. Because these responses will tend to be incorrect, the presence of "rapid-guessing behavior" could cause these items to appear to be more difficult than they…

Descriptors: Difficulty Level, Estimation (Mathematics), Guessing (Tests), Item Response Theory

The Accuracy and Use of Item Difficulty Calibrations Estimated from Judges' Ratings of Item Difficulty.

Download full text

Taube, Kurt T.; Newman, Larry S. – 1996

A method of estimating Rasch-model difficulty calibrations from judges' ratings of item difficulty is described. The ability of judges to estimate item difficulty was assessed by correlating estimated and empirical calibrations on each of four examinations offered by the American Association of State Social Work Boards. Thirteen members of the…

Descriptors: Correlation, Cutting Scores, Difficulty Level, Estimation (Mathematics)

Differential Item Functioning in Survey Research.

Download full text

Johanson, George A.; Johanson, Susan N. – 1996

Differential item functioning (DIF), or item bias, occurs when individuals in a focal group respond differently to a test item than do individuals in a reference group even when comparisons are restricted to individuals with similar overall skill levels on the trait in question. It is common in constructing a questionnaire or survey to recommend…

Descriptors: Achievement Tests, Data Analysis, Evaluation Methods, Item Analysis

Francais langue seconde: trousse d'evaluation--tests modeles pour les cours: French 31a (Avance 7); French 31b (Avance 8); French 31c (Avance 9) (French as a Second Language: Evaluation Resource Package--Model Tests for the Courses: French 31a (Advanced 7); French 31b (Advanced 8); French 31c (Advanced 9)).

Download full text

Alberta Dept. of Education, Edmonton. Language Services Branch. – 1995

The French as a Second Language model tests for advanced levels 7, 8, and 9 were designed to evaluate students' language performance, as outlined in the program of studies for Alberta, Canada, in listening and reading comprehension and oral and written production, communication skills, culture, language and general language knowledge. The tests…

Descriptors: Advanced Courses, Foreign Countries, French, Language Tests

The Effects of Content Mix and Equating Method on the Accuracy of Test Equating Using Anchor-Item Design.

Download full text

Yang, Wen-Ling – 1997

Using an anchor-item design of test equating, the effects of three equating methods (Tucker linear and two three-parameter item-response-theory-based (3PL-IRT) methods), and the content representativeness of anchor items on the accuracy of equating were examined; and an innovative way of evaluating equating accuracy appropriate for the particular…

Descriptors: Equated Scores, Item Response Theory, Raw Scores, Test Construction

« Previous Page | Next Page »

Pages: 1 | ... | 401 | 402 | 403 | 404 | 405 | 406 | 407 | 408 | 409 | ... | 636

Educational and Psychological…	416
Journal of Educational…	359
ProQuest LLC	246
Applied Psychological…	234
Applied Measurement in…	231
ETS Research Report Series	146
Educational Measurement:…	128
Journal of Educational and…	122
Online Submission	115
International Journal of…	105
Grantee Submission	98
Language Testing	93
Psychometrika	93
International Journal of…	79
Journal of Psychoeducational…	72
Educational Assessment	70
Measurement:…	57
Practical Assessment,…	56
Language Assessment Quarterly	55
Journal of Chemical Education	54
Behavioral Research and…	50
Journal of Experimental…	45
Physical Review Physics…	38
Journal of Experimental…	36
International Journal of…	35
More ▼

Journal Articles	5869
Reports - Research	5578
Reports - Evaluative	1556
Speeches/Meeting Papers	1168
Reports - Descriptive	796
Tests/Questionnaires	768
Guides - Classroom - Teacher	472
Guides - Non-Classroom	259
Dissertations/Theses -…	251
Numerical/Quantitative Data	185
Information Analyses	179
Opinion Papers	164
Guides - Classroom - Learner	162
Books	54
Collected Works - General	33
Multilingual/Bilingual…	32
Guides - General	31
Reports - General	21
Book/Product Reviews	20
ERIC Publications	20
Non-Print Media	16
ERIC Digests in Full Text	14
Collected Works - Proceedings	13
Reference Materials - General	13
Collected Works - Serials	12
More ▼

No Child Left Behind Act 2001	36
Individuals with Disabilities…	20
Every Student Succeeds Act…	5
Elementary and Secondary…	4
Race to the Top	4
Rehabilitation Act 1973…	4
Elementary and Secondary…	3
Head Start	3
Americans with Disabilities…	2
Comprehensive Education…	2
Higher Education Act…	2
Immigration Reform and…	2
Civil Rights Act 1964	1
Civil Rights Act 1964 Title…	1
Comprehensive Employment and…	1
Education Consolidation…	1
Education for All Handicapped…	1
Fair Labor Standards Act	1
Higher Education Act Title II	1
Higher Education Opportunity…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Jeanne Clery Disclosure of…	1
Job Training Partnership Act…	1
Kentucky Education Reform Act…	1
More ▼

National Assessment of…	182
Program for International…	178
SAT (College Admission Test)	137
Trends in International…	114
Test of English as a Foreign…	85
Graduate Record Examinations	74
ACT Assessment	44
Advanced Placement…	34
Texas Educational Assessment…	32
Law School Admission Test	30
Wechsler Intelligence Scale…	26
Iowa Tests of Basic Skills	25
Progress in International…	25
Stanford Achievement Tests	24
Raven Progressive Matrices	22
Armed Services Vocational…	20
International English…	20
Peabody Picture Vocabulary…	20
California Achievement Tests	18
Comprehensive Tests of Basic…	18
Test of English for…	17
Metropolitan Achievement Tests	15
General Educational…	14
Graduate Management Admission…	14
Wechsler Adult Intelligence…	13
More ▼