ERIC - Search Results

Publication Date

In 2026	0
Since 2025	197
Since 2022 (last 5 years)	1067
Since 2017 (last 10 years)	2577
Since 2007 (last 20 years)	4938

Descriptor

Test Items	9530
Test Construction	2714
Foreign Countries	2179
Item Response Theory	1868
Difficulty Level	1620
Item Analysis	1501
Test Validity	1415
Test Reliability	1186
Multiple Choice Tests	1154
Scores	1136
Computer Assisted Testing	1057
Comparative Analysis	1024
Test Format	956
Higher Education	877
Achievement Tests	854
Statistical Analysis	850
Mathematics Tests	845
Psychometrics	832
Test Bias	770
Models	753
Student Evaluation	736
Language Tests	699
Correlation	695
Evaluation Methods	674
Scoring	633
More ▼

Author

van der Linden, Wim J.	69
Tindal, Gerald	50
Hambleton, Ronald K.	45
Alonzo, Julie	41
Chang, Hua-Hua	40
Plake, Barbara S.	40
Sinharay, Sandip	37
Reckase, Mark D.	36
Wainer, Howard	33
Dorans, Neil J.	32
Gierl, Mark J.	30
Sireci, Stephen G.	28
Wang, Wen-Chung	26
Cohen, Allan S.	25
Meijer, Rob R.	25
Samejima, Fumiko	24
Stocking, Martha L.	24
Anderson, Daniel	23
Zwick, Rebecca	23
Veldkamp, Bernard P.	22
Haladyna, Thomas M.	21
Kim, Seock-Ho	21
Wise, Steven L.	21
Kim, Sooyeon	20
More ▼

Education Level

Higher Education	1307
Postsecondary Education	1057
Secondary Education	925
Elementary Education	715
Middle Schools	419
High Schools	362
Elementary Secondary Education	358
Junior High Schools	319
Grade 8	255
Intermediate Grades	209
Grade 4	183
Early Childhood Education	177
Grade 5	134
Primary Education	126
Grade 7	113
Grade 3	111
Grade 6	107
Grade 9	68
Grade 2	56
Grade 10	52
Grade 12	52
Kindergarten	50
Adult Education	39
Grade 11	37
Grade 1	36
More ▼

Audience

Practitioners	653
Teachers	563
Researchers	250
Students	201
Administrators	81
Policymakers	22
Parents	17
Counselors	8
Community	7
Support Staff	3
Media Staff	1
More ▼

Location

Turkey	225
Canada	223
Australia	155
Germany	116
United States	99
China	90
Florida	86
Indonesia	82
Taiwan	78
United Kingdom	73
California	65
Japan	65
Netherlands	64
Iran	62
United Kingdom (England)	57
South Africa	48
Missouri	45
New York	45
Oklahoma	44
South Korea	44
Malaysia	42
Texas	42
Israel	37
Singapore	37
Sweden	37
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	4
Meets WWC Standards with or without Reservations	4
Does not meet standards	1

Showing 6,316 to 6,330 of 9,530 results Save | Export

Item Readability and Science Achievement in TIMSS 2003 in South Africa

Peer reviewed

Direct link

Dempster, Edith R.; Reddy, Vijay – Science Education, 2007

This study investigated the relationship between readability of 73 text-only multiple-choice questions from Trends in International Mathematics and Science Study (TIMSS) 2003 and performance of two groups of South African learners: those with limited English-language proficiency (learners attending African schools) and those with better…

Descriptors: Instructional Effectiveness, Foreign Countries, Disadvantaged Youth, Sentences

An Investigation of IRT-Based Assembly of the TOEFL Test. TOEFL Technical Report.

Download full text

Chyn, Susan; And Others – 1995

The current study, carried out jointly by Test Development and Statistical Analysis staff at Educational Testing Service investigated the feasibility of the Automated Item Selection (AIS) procedure for the Test of English as a Foreign Language (TOEFL). Item-response theory (IRT)-based statistical specifications were developed. Two TOEFL test forms…

Descriptors: English (Second Language), Item Banks, Item Response Theory, Language Tests

Marginal Maximum Likelihood Estimation for a Psychometric Model of Discontinuous Development.

Download full text

Mislevy, Robert J.; Wilson, Mark – 1992

Standard item response theory (IRT) models posit latent variables to account for regularities in students' performance on test items. They can accommodate learning only if the expected changes in performance are smooth, and, in an appropriate metric, uniform over items. Wilson's "Saltus" model extends the ideas of IRT to development that…

Descriptors: Bayesian Statistics, Change, Development, Item Response Theory

Three Practical Issues for Modern Adaptive Testing Item Pools.

Download full text

Stocking, Martha L. – 1994

As adaptive testing moves toward operational implementation in large scale testing programs, where it is important that adaptive tests be as parallel as possible to existing linear tests, a number of practical issues arise. This paper concerns three such issues. First, optimum item pool size is difficult to determine in advance of pool…

Descriptors: Adaptive Testing, Computer Assisted Testing, Item Banks, Standards

An Initial Evaluation of the Use of Bivariate Matching in DIF Analyses for Formula Scored Tests.

Download full text

Pomplun, Mark; And Others – 1992

This study evaluated the use of bivariate matching as a solution to the problem of studying differential item functioning (DIF) with formula scored tests. Using Scholastic Aptitude Test verbal data with large samples, both male/female and black/white group comparisons were investigated. Mantel-Haenszel (MH) delta-(D) DIF values and DIF category…

Descriptors: Blacks, Criteria, Females, Item Bias

Differential Item Functioning from a Multilevel Perspective.

Download full text

van den Bergh, Huub; And Others – 1995

The term differential item functioning (DIF) refers to whether or not the same psychological constructs are measured across different groups. If an item does not measure the same skills or subskills in different populations, it is said to function differentially or to display item bias. A multilevel approach to DIF is proposed. In such a model,…

Descriptors: Cluster Analysis, Estimation (Mathematics), Identification, Item Bias

Analyzing the Option Effects of Difficult TOEFL Items with Low Biserials: Methods Developed for Use by Test Assemblers.

Download full text

Hicks, Marilyn M. – 1988

Several exploratory analyses of the fifths data generated by Test of English as a Foreign Language (TOEFL) item analyses were developed in order to evaluate the effects of options on the discriminability of difficult items and to identify difficult items with low, unreliable biserials that had been rejected by test developers, but for which…

Descriptors: Difficulty Level, Estimation (Mathematics), Identification, Item Analysis

The Effect of Anchor Length and Equating Method on the Accuracy of Test Equating: Comparisons of Linear and IRT-Based Equating Using an Anchor-Item Design.

Download full text

Yang, Wen-Ling; Houang, Richard T. – 1996

The influence of anchor length on the accuracy of test equating was studied using Tucker's linear method and two Item-Response-Theory (IRT) based methods, focusing on whether equating accuracy improved with more anchor items, whether the anchor effect depended on the equating method used, and the adequacy of the inclusion of the guessing parameter…

Descriptors: Equated Scores, Estimation (Mathematics), Guessing (Tests), Item Response Theory

Formulation of an Alternative Method of Determining Levels of Comparison for the Generalized Mantel-Haenszel Using IRT Ability Estimates.

Frey, Sharon L. – 1996

The Mantel-Haenszel procedure (N. Mantel and W. Haenszel, 1959) and its extension to constructed response items, the Generalized Mantel Haenszel (A. Agresti, 1990), compare performance of subgroups across different score groups to determine differential item functioning (DIF). At each level of comparison, or score group, the subgroups are…

Descriptors: Ability, Comparative Analysis, Constructed Response, Ethnic Groups

Single-Item Measurement: Would You Recommend It to a Friend?

Download full text

Johanson, George A.; Doston, Glenn – 1994

Analyses of questionnaire data from a program evaluation indicate that the two dichotomous items "Would you recommend this to a friend?" and "Would you choose to do this again?" are not as interchangeable as might be expected from the survey literature. As part of the evaluation of a university program, a survey of graduates…

Descriptors: College Graduates, Data Analysis, Graduate Surveys, Higher Education

Intrajudge Consistency Using the Angoff Standard-Setting Method.

Download full text

Plake, Barbara S.; Impara, James C. – 1996

This study investigated the intrajudge consistency of Angoff-based item performance estimates. The examination used was a certification examination in an emergency medicine specialty. Ten expert panelists rated the same 24 items twice during an operational standard setting study. Results indicate that the panelists were highly consistent, in terms…

Descriptors: Cutting Scores, Interrater Reliability, Licensing Examinations (Professions), Performance Based Assessment

Confirmatory Analysis of Test Structure Using Multidimensional Item Response Theory.

Download full text

McKinley, Robert – 1989

A confirmatory approach to assessing test structure using multidimensional item response theory (MIRT) was developed and evaluated. The approach involved adding to the exponent of the MIRT model an item structure matrix that allows the user to specify the ability dimensions measured by an item. Various combinations of item structures were fit to…

Descriptors: Ability, Chi Square, Goodness of Fit, Item Response Theory

The Graded Unfolding Model: A Unidimensional Item Response Model for Unfolding Graded Responses.

Download full text

Roberts, James S.; Laughlin, James E. – 1996

Binary or graded disagree-agree responses to attitude items are often collected for the purpose of attitude measurement. Although such data are sometimes analyzed with cumulative measurement models, recent investigations suggest that unfolding models are more appropriate (J. S. Roberts, 1995; W. H. Van Schuur and H. A. L. Kiers, 1994). Advances in…

Descriptors: Attitude Measures, Estimation (Mathematics), Item Response Theory, Mathematical Models

DIFferential Testlet Functioning Definitions and Detection. Program Statistics Research Technical Report No. 91-9.

Download full text

Wainer, Howard; And Others – 1991

It is sometimes sensible to think of the fundamental unit of test construction as being larger than an individual item. This unit, dubbed the testlet, must pass muster in the same way that items do. One criterion of a good item is the absence of differential item functioning (DIF). The item must function in the same way as all important…

Descriptors: Definitions, Identification, Item Bias, Item Response Theory

A Comparison of Testlet Reliability for Polytomous Scoring Methods.

Download full text

Crehan, Kevin D.; And Others – 1993

Among the measurement techniques receiving greater attention is the context-dependent item set or testlet. The context-dependent item set consists of a scenario and related test questions. This item format is generally believed to be able to tap higher level thinking. Unfortunately, this item form leads to inter-item dependence within item sets…

Descriptors: Comparative Analysis, Item Response Theory, Measurement Techniques, Reading Tests

« Previous Page | Next Page »

Pages: 1 | ... | 418 | 419 | 420 | 421 | 422 | 423 | 424 | 425 | 426 | ... | 636

Educational and Psychological…	416
Journal of Educational…	359
ProQuest LLC	246
Applied Psychological…	234
Applied Measurement in…	231
ETS Research Report Series	146
Educational Measurement:…	128
Journal of Educational and…	122
Online Submission	115
International Journal of…	105
Grantee Submission	98
Language Testing	93
Psychometrika	93
International Journal of…	79
Journal of Psychoeducational…	72
Educational Assessment	69
Measurement:…	57
Practical Assessment,…	56
Language Assessment Quarterly	55
Journal of Chemical Education	54
Behavioral Research and…	50
Journal of Experimental…	45
Physical Review Physics…	38
Journal of Experimental…	36
International Journal of…	35
More ▼

Journal Articles	5866
Reports - Research	5575
Reports - Evaluative	1556
Speeches/Meeting Papers	1168
Reports - Descriptive	796
Tests/Questionnaires	768
Guides - Classroom - Teacher	472
Guides - Non-Classroom	259
Dissertations/Theses -…	251
Numerical/Quantitative Data	185
Information Analyses	179
Opinion Papers	164
Guides - Classroom - Learner	162
Books	54
Collected Works - General	33
Multilingual/Bilingual…	32
Guides - General	31
Reports - General	21
Book/Product Reviews	20
ERIC Publications	20
Non-Print Media	16
ERIC Digests in Full Text	14
Collected Works - Proceedings	13
Reference Materials - General	13
Collected Works - Serials	12
More ▼

No Child Left Behind Act 2001	36
Individuals with Disabilities…	20
Every Student Succeeds Act…	5
Elementary and Secondary…	4
Race to the Top	4
Rehabilitation Act 1973…	4
Elementary and Secondary…	3
Head Start	3
Americans with Disabilities…	2
Comprehensive Education…	2
Higher Education Act…	2
Immigration Reform and…	2
Civil Rights Act 1964	1
Civil Rights Act 1964 Title…	1
Comprehensive Employment and…	1
Education Consolidation…	1
Education for All Handicapped…	1
Fair Labor Standards Act	1
Higher Education Act Title II	1
Higher Education Opportunity…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Jeanne Clery Disclosure of…	1
Job Training Partnership Act…	1
Kentucky Education Reform Act…	1
More ▼

National Assessment of…	182
Program for International…	178
SAT (College Admission Test)	137
Trends in International…	114
Test of English as a Foreign…	85
Graduate Record Examinations	74
ACT Assessment	44
Advanced Placement…	34
Texas Educational Assessment…	32
Law School Admission Test	30
Wechsler Intelligence Scale…	26
Iowa Tests of Basic Skills	25
Progress in International…	25
Stanford Achievement Tests	24
Raven Progressive Matrices	22
Armed Services Vocational…	20
International English…	20
Peabody Picture Vocabulary…	20
California Achievement Tests	18
Comprehensive Tests of Basic…	18
Test of English for…	17
Metropolitan Achievement Tests	15
General Educational…	14
Graduate Management Admission…	14
Wechsler Adult Intelligence…	13
More ▼