ERIC - Search Results

Publication Date

In 2026	0
Since 2025	215
Since 2022 (last 5 years)	1084
Since 2017 (last 10 years)	2594
Since 2007 (last 20 years)	4955

Descriptor

Test Items	9547
Test Construction	2723
Foreign Countries	2184
Item Response Theory	1872
Difficulty Level	1623
Item Analysis	1502
Test Validity	1416
Test Reliability	1187
Multiple Choice Tests	1158
Scores	1137
Computer Assisted Testing	1058
Comparative Analysis	1024
Test Format	956
Higher Education	877
Achievement Tests	855
Statistical Analysis	852
Mathematics Tests	845
Psychometrics	833
Test Bias	772
Models	754
Student Evaluation	736
Language Tests	699
Correlation	695
Evaluation Methods	674
Scoring	633
More ▼

Author

van der Linden, Wim J.	69
Tindal, Gerald	50
Hambleton, Ronald K.	45
Alonzo, Julie	41
Chang, Hua-Hua	40
Plake, Barbara S.	40
Sinharay, Sandip	37
Reckase, Mark D.	36
Wainer, Howard	33
Dorans, Neil J.	32
Gierl, Mark J.	30
Sireci, Stephen G.	28
Wang, Wen-Chung	26
Cohen, Allan S.	25
Meijer, Rob R.	25
Samejima, Fumiko	24
Stocking, Martha L.	24
Anderson, Daniel	23
Zwick, Rebecca	23
Veldkamp, Bernard P.	22
Haladyna, Thomas M.	21
Kim, Seock-Ho	21
Wise, Steven L.	21
Kim, Sooyeon	20
More ▼

Education Level

Higher Education	1314
Postsecondary Education	1064
Secondary Education	927
Elementary Education	716
Middle Schools	420
High Schools	363
Elementary Secondary Education	359
Junior High Schools	320
Grade 8	256
Intermediate Grades	209
Grade 4	183
Early Childhood Education	178
Grade 5	134
Primary Education	126
Grade 7	113
Grade 3	111
Grade 6	107
Grade 9	69
Grade 2	56
Grade 10	53
Grade 12	52
Kindergarten	50
Adult Education	39
Grade 11	38
Grade 1	36
More ▼

Audience

Practitioners	653
Teachers	563
Researchers	250
Students	201
Administrators	81
Policymakers	22
Parents	17
Counselors	8
Community	7
Support Staff	3
Media Staff	1
More ▼

Location

Turkey	226
Canada	223
Australia	155
Germany	116
United States	99
China	90
Florida	86
Indonesia	82
Taiwan	78
United Kingdom	73
California	66
Japan	65
Netherlands	64
Iran	62
United Kingdom (England)	57
South Africa	48
New York	46
Missouri	45
Oklahoma	44
South Korea	44
Malaysia	42
Texas	42
Sweden	38
Israel	37
Singapore	37
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	4
Meets WWC Standards with or without Reservations	4
Does not meet standards	1

Showing 6,331 to 6,345 of 9,547 results Save | Export

Three Practical Issues for Modern Adaptive Testing Item Pools.

Download full text

Stocking, Martha L. – 1994

As adaptive testing moves toward operational implementation in large scale testing programs, where it is important that adaptive tests be as parallel as possible to existing linear tests, a number of practical issues arise. This paper concerns three such issues. First, optimum item pool size is difficult to determine in advance of pool…

Descriptors: Adaptive Testing, Computer Assisted Testing, Item Banks, Standards

An Initial Evaluation of the Use of Bivariate Matching in DIF Analyses for Formula Scored Tests.

Download full text

Pomplun, Mark; And Others – 1992

This study evaluated the use of bivariate matching as a solution to the problem of studying differential item functioning (DIF) with formula scored tests. Using Scholastic Aptitude Test verbal data with large samples, both male/female and black/white group comparisons were investigated. Mantel-Haenszel (MH) delta-(D) DIF values and DIF category…

Descriptors: Blacks, Criteria, Females, Item Bias

Differential Item Functioning from a Multilevel Perspective.

Download full text

van den Bergh, Huub; And Others – 1995

The term differential item functioning (DIF) refers to whether or not the same psychological constructs are measured across different groups. If an item does not measure the same skills or subskills in different populations, it is said to function differentially or to display item bias. A multilevel approach to DIF is proposed. In such a model,…

Descriptors: Cluster Analysis, Estimation (Mathematics), Identification, Item Bias

Analyzing the Option Effects of Difficult TOEFL Items with Low Biserials: Methods Developed for Use by Test Assemblers.

Download full text

Hicks, Marilyn M. – 1988

Several exploratory analyses of the fifths data generated by Test of English as a Foreign Language (TOEFL) item analyses were developed in order to evaluate the effects of options on the discriminability of difficult items and to identify difficult items with low, unreliable biserials that had been rejected by test developers, but for which…

Descriptors: Difficulty Level, Estimation (Mathematics), Identification, Item Analysis

The Effect of Anchor Length and Equating Method on the Accuracy of Test Equating: Comparisons of Linear and IRT-Based Equating Using an Anchor-Item Design.

Download full text

Yang, Wen-Ling; Houang, Richard T. – 1996

The influence of anchor length on the accuracy of test equating was studied using Tucker's linear method and two Item-Response-Theory (IRT) based methods, focusing on whether equating accuracy improved with more anchor items, whether the anchor effect depended on the equating method used, and the adequacy of the inclusion of the guessing parameter…

Descriptors: Equated Scores, Estimation (Mathematics), Guessing (Tests), Item Response Theory

Formulation of an Alternative Method of Determining Levels of Comparison for the Generalized Mantel-Haenszel Using IRT Ability Estimates.

Frey, Sharon L. – 1996

The Mantel-Haenszel procedure (N. Mantel and W. Haenszel, 1959) and its extension to constructed response items, the Generalized Mantel Haenszel (A. Agresti, 1990), compare performance of subgroups across different score groups to determine differential item functioning (DIF). At each level of comparison, or score group, the subgroups are…

Descriptors: Ability, Comparative Analysis, Constructed Response, Ethnic Groups

Single-Item Measurement: Would You Recommend It to a Friend?

Download full text

Johanson, George A.; Doston, Glenn – 1994

Analyses of questionnaire data from a program evaluation indicate that the two dichotomous items "Would you recommend this to a friend?" and "Would you choose to do this again?" are not as interchangeable as might be expected from the survey literature. As part of the evaluation of a university program, a survey of graduates…

Descriptors: College Graduates, Data Analysis, Graduate Surveys, Higher Education

Intrajudge Consistency Using the Angoff Standard-Setting Method.

Download full text

Plake, Barbara S.; Impara, James C. – 1996

This study investigated the intrajudge consistency of Angoff-based item performance estimates. The examination used was a certification examination in an emergency medicine specialty. Ten expert panelists rated the same 24 items twice during an operational standard setting study. Results indicate that the panelists were highly consistent, in terms…

Descriptors: Cutting Scores, Interrater Reliability, Licensing Examinations (Professions), Performance Based Assessment

Confirmatory Analysis of Test Structure Using Multidimensional Item Response Theory.

Download full text

McKinley, Robert – 1989

A confirmatory approach to assessing test structure using multidimensional item response theory (MIRT) was developed and evaluated. The approach involved adding to the exponent of the MIRT model an item structure matrix that allows the user to specify the ability dimensions measured by an item. Various combinations of item structures were fit to…

Descriptors: Ability, Chi Square, Goodness of Fit, Item Response Theory

The Graded Unfolding Model: A Unidimensional Item Response Model for Unfolding Graded Responses.

Download full text

Roberts, James S.; Laughlin, James E. – 1996

Binary or graded disagree-agree responses to attitude items are often collected for the purpose of attitude measurement. Although such data are sometimes analyzed with cumulative measurement models, recent investigations suggest that unfolding models are more appropriate (J. S. Roberts, 1995; W. H. Van Schuur and H. A. L. Kiers, 1994). Advances in…

Descriptors: Attitude Measures, Estimation (Mathematics), Item Response Theory, Mathematical Models

DIFferential Testlet Functioning Definitions and Detection. Program Statistics Research Technical Report No. 91-9.

Download full text

Wainer, Howard; And Others – 1991

It is sometimes sensible to think of the fundamental unit of test construction as being larger than an individual item. This unit, dubbed the testlet, must pass muster in the same way that items do. One criterion of a good item is the absence of differential item functioning (DIF). The item must function in the same way as all important…

Descriptors: Definitions, Identification, Item Bias, Item Response Theory

A Comparison of Testlet Reliability for Polytomous Scoring Methods.

Download full text

Crehan, Kevin D.; And Others – 1993

Among the measurement techniques receiving greater attention is the context-dependent item set or testlet. The context-dependent item set consists of a scenario and related test questions. This item format is generally believed to be able to tap higher level thinking. Unfortunately, this item form leads to inter-item dependence within item sets…

Descriptors: Comparative Analysis, Item Response Theory, Measurement Techniques, Reading Tests

Step Fit Analysis with Polytomously Scored Items.

Download full text

Tang, Huixing – 1994

Fit analysis is widely performed in item response theory (IRT) based test development to assess the fit of individual items to the IRT model being used. The paper explores a step fit analysis procedure that is an extension of IRT-based item fit diagnostics applied to the response categories present in popular performance-based tasks. The step fit…

Descriptors: Elementary Secondary Education, Goodness of Fit, Item Response Theory, Models

Extreme Responding Style and the Concreteness-Abstractness Dimension.

Download full text

Swearingen, Dorothy L. – 1998

The problem of response set is important for questionnaire designers and interpreters, but the public is affected as well if policy is determined on the basis of unsupported conclusions. This study focused on one of the most researched response sets, extreme responding (ER), or extreme checking styles, and its relationship to one dimension of…

Descriptors: Abstract Reasoning, Cognitive Style, College Students, Higher Education

Using Response-Time Constraints in Item Selection To Control for Differential Speededness in Computerized Adaptive Testing. Research Report 98-06.

Download full text

van der Linden, Wim J.; Scrams, David J.; Schnipke, Deborah L. – 1998

An item-selection algorithm to neutralize the differential effects of time limits on scores on computerized adaptive tests is proposed. The method is based on a statistical model for the response-time distributions of the examinees on items in the pool that is updated each time a new item has been administered. Predictions from the model are used…

Descriptors: Adaptive Testing, Algorithms, Computer Assisted Testing, Foreign Countries

« Previous Page | Next Page »

Pages: 1 | ... | 419 | 420 | 421 | 422 | 423 | 424 | 425 | 426 | 427 | ... | 637

Educational and Psychological…	416
Journal of Educational…	367
ProQuest LLC	246
Applied Psychological…	234
Applied Measurement in…	231
ETS Research Report Series	146
Educational Measurement:…	128
Journal of Educational and…	122
Online Submission	115
International Journal of…	105
Grantee Submission	98
Language Testing	93
Psychometrika	93
International Journal of…	80
Journal of Psychoeducational…	72
Educational Assessment	70
Practical Assessment,…	60
Measurement:…	57
Language Assessment Quarterly	55
Journal of Chemical Education	54
Behavioral Research and…	50
Journal of Experimental…	45
Physical Review Physics…	38
Journal of Experimental…	36
International Journal of…	35
More ▼

Journal Articles	5882
Reports - Research	5592
Reports - Evaluative	1556
Speeches/Meeting Papers	1168
Reports - Descriptive	796
Tests/Questionnaires	768
Guides - Classroom - Teacher	472
Guides - Non-Classroom	259
Dissertations/Theses -…	251
Numerical/Quantitative Data	185
Information Analyses	179
Opinion Papers	164
Guides - Classroom - Learner	162
Books	54
Collected Works - General	33
Multilingual/Bilingual…	32
Guides - General	31
Reports - General	21
Book/Product Reviews	20
ERIC Publications	20
Non-Print Media	16
ERIC Digests in Full Text	14
Collected Works - Proceedings	13
Reference Materials - General	13
Collected Works - Serials	12
More ▼

No Child Left Behind Act 2001	36
Individuals with Disabilities…	20
Every Student Succeeds Act…	5
Elementary and Secondary…	4
Race to the Top	4
Rehabilitation Act 1973…	4
Elementary and Secondary…	3
Head Start	3
Americans with Disabilities…	2
Comprehensive Education…	2
Higher Education Act…	2
Immigration Reform and…	2
Civil Rights Act 1964	1
Civil Rights Act 1964 Title…	1
Comprehensive Employment and…	1
Education Consolidation…	1
Education for All Handicapped…	1
Fair Labor Standards Act	1
Higher Education Act Title II	1
Higher Education Opportunity…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Jeanne Clery Disclosure of…	1
Job Training Partnership Act…	1
Kentucky Education Reform Act…	1
More ▼

National Assessment of…	182
Program for International…	179
SAT (College Admission Test)	137
Trends in International…	114
Test of English as a Foreign…	85
Graduate Record Examinations	74
ACT Assessment	44
Advanced Placement…	34
Texas Educational Assessment…	32
Law School Admission Test	30
Wechsler Intelligence Scale…	26
Iowa Tests of Basic Skills	25
Progress in International…	25
Stanford Achievement Tests	24
Raven Progressive Matrices	22
Armed Services Vocational…	20
International English…	20
Peabody Picture Vocabulary…	20
California Achievement Tests	18
Comprehensive Tests of Basic…	18
Test of English for…	17
Metropolitan Achievement Tests	15
General Educational…	14
Graduate Management Admission…	14
Wechsler Adult Intelligence…	13
More ▼