ERIC - Search Results

Publication Date

In 2026	0
Since 2025	215
Since 2022 (last 5 years)	1084
Since 2017 (last 10 years)	2594
Since 2007 (last 20 years)	4955

Descriptor

Test Items	9547
Test Construction	2723
Foreign Countries	2184
Item Response Theory	1872
Difficulty Level	1623
Item Analysis	1502
Test Validity	1416
Test Reliability	1187
Multiple Choice Tests	1158
Scores	1137
Computer Assisted Testing	1058
Comparative Analysis	1024
Test Format	956
Higher Education	877
Achievement Tests	855
Statistical Analysis	852
Mathematics Tests	845
Psychometrics	833
Test Bias	772
Models	754
Student Evaluation	736
Language Tests	699
Correlation	695
Evaluation Methods	674
Scoring	633
More ▼

Author

van der Linden, Wim J.	69
Tindal, Gerald	50
Hambleton, Ronald K.	45
Alonzo, Julie	41
Chang, Hua-Hua	40
Plake, Barbara S.	40
Sinharay, Sandip	37
Reckase, Mark D.	36
Wainer, Howard	33
Dorans, Neil J.	32
Gierl, Mark J.	30
Sireci, Stephen G.	28
Wang, Wen-Chung	26
Cohen, Allan S.	25
Meijer, Rob R.	25
Samejima, Fumiko	24
Stocking, Martha L.	24
Anderson, Daniel	23
Zwick, Rebecca	23
Veldkamp, Bernard P.	22
Haladyna, Thomas M.	21
Kim, Seock-Ho	21
Wise, Steven L.	21
Kim, Sooyeon	20
More ▼

Education Level

Higher Education	1314
Postsecondary Education	1064
Secondary Education	927
Elementary Education	716
Middle Schools	420
High Schools	363
Elementary Secondary Education	359
Junior High Schools	320
Grade 8	256
Intermediate Grades	209
Grade 4	183
Early Childhood Education	178
Grade 5	134
Primary Education	126
Grade 7	113
Grade 3	111
Grade 6	107
Grade 9	69
Grade 2	56
Grade 10	53
Grade 12	52
Kindergarten	50
Adult Education	39
Grade 11	38
Grade 1	36
More ▼

Audience

Practitioners	653
Teachers	563
Researchers	250
Students	201
Administrators	81
Policymakers	22
Parents	17
Counselors	8
Community	7
Support Staff	3
Media Staff	1
More ▼

Location

Turkey	226
Canada	223
Australia	155
Germany	116
United States	99
China	90
Florida	86
Indonesia	82
Taiwan	78
United Kingdom	73
California	66
Japan	65
Netherlands	64
Iran	62
United Kingdom (England)	57
South Africa	48
New York	46
Missouri	45
Oklahoma	44
South Korea	44
Malaysia	42
Texas	42
Sweden	38
Israel	37
Singapore	37
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	4
Meets WWC Standards with or without Reservations	4
Does not meet standards	1

Showing 4,876 to 4,890 of 9,547 results Save | Export

Logistic Regression and Its Use in Detecting Differential Item Functioning in Polytomous Items.

Peer reviewed

French, Ann W.; Miller, Timothy R. – Journal of Educational Measurement, 1996

A computer simulation study was conducted to determine the feasibility of using logistic regression procedures to detect differential item functioning (DIF) in polytomous items. Results indicate that logistic regression is powerful in detecting most forms of DIF, although it requires large amounts of data manipulation and careful interpretation.…

Descriptors: Computer Simulation, Identification, Item Bias, Test Interpretation

The Recovery of the Density Scale Using a Stochastic Quasi-Realization of Additive Conjoint Measurement.

Peer reviewed

Pelton, Timothy W.; Bunderson, C. Victor – Journal of Applied Measurement, 2003

Attempted to illuminate practical limitations on the Rasch model by focusing on the recovery of the density scale through five simulation trials. Results show that when error distributions are insufficient, the results may be ordinal at best, and when error distributions are nonsymmetrical, the positions of items may be biased with respect to the…

Descriptors: Error of Measurement, Item Response Theory, Simulation, Test Items

Nonparametric IRT: Testing the Bi-Isotonicity of Isotonic Probabilistic Models (ISOP).

Peer reviewed

Scheiblechner, Hartmann – Psychometrika, 2003

Presented nonparametric tests for testing the validity of polytomous unidimensional ordinal probabilistic polytomous item response theory models along with procedures for testing the comonotonicity of two item sets and for item selection. Describes advantages of the new approach. (SLD)

Descriptors: Item Response Theory, Nonparametric Statistics, Selection, Test Items

Essays on Item Response Theory. A. Boomsma, M. A. J. van Duijn, and T. A. B. Snijders (Eds.) [Book Review].

Peer reviewed

Bolt, Daniel – Psychometrika, 2003

Any item response theory (IRT) researcher or practitioner will find something of interest in this book, which covers a broad range of topics in essays by well-known researchers. Chapters are organized into sections devoted to parametric and nonparametric IRT topics. (SLD)

Descriptors: Item Response Theory, Measurement Techniques, Test Construction, Test Items

Revising the JDI Work Satisfaction Subscale: Insights into Stress and Control.

Peer reviewed

Stanton, Jeffrey M.; Bachiochi, Peter D.; Robie, Chet; Perez, Lisa M.; Smith, Patricia C. – Educational and Psychological Measurement, 2002

Studied the Work Satisfaction subscale of the Job Descriptive Index (JDI) to determine the difference between measuring work stress and measuring work satisfaction. Results from samples of 1,623 and 314 adults provide evidence supporting the removal of some contaminating items from the JDI. (SLD)

Descriptors: Adults, Measures (Individuals), Stress Variables, Test Construction

Linking Multidimensional Item Calibrations.

Peer reviewed

Davey, Tim; And Others – Applied Psychological Measurement, 1996

Scales defined by most item response theory (IRT) models are truly invariant with respect to certain linear transformations of parameters. The problem is to find the proper transformation to place calibrations on a common scale. This paper explores issues of extending and adapting unidimensional linking procedures to multidimensional IRT models.…

Descriptors: Equated Scores, Item Response Theory, Models, Scaling

Detection of Differential Item Functioning in Large-Scale State Assessments: A Study Evaluating a Two-Stage Approach.

Peer reviewed

Zenisky, April L.; Hambleton, Ronald K.; Robin, Frederic – Educational and Psychological Measurement, 2003

Studied a two-stage methodology for evaluating differential item functioning (DIF) in large-scale assessment data using a sample of 60,000 students taking a large-scale assessment. Findings illustrate the merit of iterative approached for DIF detection, since items identified at one stage were not necessarily the same as those identified at the…

Descriptors: Item Bias, Large Scale Assessment, Research Methodology, Test Items

Differential Item Functioning Results May Change Depending on How an Item Is Scored: An Illustration with the Center for Epidemiologic Studies Depression Scale.

Peer reviewed

Gelin, Michaela N.; Zumbo, Bruno D. – Educational and Psychological Measurement, 2003

Investigated potentially biased scale items on the Center for Epidemiological Studies Depression scale (CES-D; Radloff, 1977) in a sample of 600 adults. Overall, results indicate that the scoring method has an effect on differential item functioning (DIF), and that DIF is a property of the item, scoring method, and purpose of the assessment. (SLD)

Descriptors: Depression (Psychology), Item Bias, Scoring, Test Items

Illustrating the Use of Nonparametric Regression To Assess Differential Item and Bundle Functioning among Multiple Groups.

Peer reviewed

Gierl, Mark J.; Bolt, Daniel M. – International Journal of Testing, 2001

Presents an overview of nonparametric regression as it allies to differential item functioning analysis and then provides three examples to illustrate how nonparametric regression can be applied to multilingual, multicultural data to study group differences. (SLD)

Descriptors: Groups, Item Bias, Nonparametric Statistics, Regression (Statistics)

Advances in Career Assessment and the 1994 Strong Interest Inventory.

Peer reviewed

Harmon, Lenore W.; Borgen, Fred H. – Journal of Career Assessment, 1995

Data from over 50,000 people in 50 occupational groups were used to revise the Strong Interest Inventory. New General Reference Samples containing over 18,000 people were used to construct scales, and nearly every scale was revised. (SK)

Descriptors: Evaluation Criteria, Interest Inventories, Measures (Individuals), Occupations

Evaluating the Accuracy of Judgments Obtained from Item Review Committees.

Peer reviewed

Engelhard, George, Jr.; Davis, Melodee; Hansche, Linda – Applied Measurement in Education, 1999

Examined whether reviewers on item-review committees can identify accurately test items that exhibit a variety of flaws. Results with 39 reviewers of a 75-item test show that reviewers exhibit fairly high accuracy rates overall, with statistically significant differences in judgmental accuracy among reviewers. (SLD)

Descriptors: Decision Making, Judges, Review (Reexamination), Test Construction

Estimating Reliability under a Generalizability Theory Model for Test Scores Composed of Testlets.

Peer reviewed

Lee, Guemin; Frisbie, David A. – Applied Measurement in Education, 1999

Studied the appropriateness and implications of using a generalizability theory approach to estimating the reliability of scores from tests composed of testlets. Analyses of data from two national standardization samples suggest that manipulating the number of passages is a more productive way to obtain efficient measurement than manipulating the…

Descriptors: Generalizability Theory, Models, National Surveys, Reliability

A Comparison of the Person Response Function to the "l(z)" Person-Fit Statistic.

Peer reviewed

Nering, Michael L.; Meijer, Rob R. – Applied Psychological Measurement, 1998

Compared the person-response function (PRF) method for identifying examinees who respond to test items in a manner divergent from the underlying test model to the "l(z)" index of Drasgow and others (1985). Although performance of the "l(z)" index was superior in most cases, the PRF was useful in some conditions. (SLD)

Descriptors: Comparative Analysis, Item Response Theory, Models, Responses

Simultaneous Assembly of Multiple Test Forms.

Peer reviewed

van der Linden, Wim J.; Adema, Jos J. – Journal of Educational Measurement, 1998

Proposes an algorithm for the assembly of multiple test forms in which the multiple-form problem is reduced to a series of computationally less intensive two-form problems. Illustrates how the method can be implemented using 0-1 linear programming and gives two examples. (SLD)

Descriptors: Algorithms, Linear Programming, Test Construction, Test Format

Application of a Method of Estimating DIF for Polytomous Test Items.

Peer reviewed

Camilli, Gregory; Congdon, Peter – Journal of Educational and Behavioral Statistics, 1999

Demonstrates a method for studying differential item functioning (DIF) that can be used with dichotomous or polytomous items and that is valid for data that follow a partial credit Item Response Theory model. A simulation study shows that positively biased Type I error rates are in accord with results from previous studies. (SLD)

Descriptors: Estimation (Mathematics), Item Bias, Item Response Theory, Test Items

« Previous Page | Next Page »

Pages: 1 | ... | 322 | 323 | 324 | 325 | 326 | 327 | 328 | 329 | 330 | ... | 637

Educational and Psychological…	416
Journal of Educational…	367
ProQuest LLC	246
Applied Psychological…	234
Applied Measurement in…	231
ETS Research Report Series	146
Educational Measurement:…	128
Journal of Educational and…	122
Online Submission	115
International Journal of…	105
Grantee Submission	98
Language Testing	93
Psychometrika	93
International Journal of…	80
Journal of Psychoeducational…	72
Educational Assessment	70
Practical Assessment,…	60
Measurement:…	57
Language Assessment Quarterly	55
Journal of Chemical Education	54
Behavioral Research and…	50
Journal of Experimental…	45
Physical Review Physics…	38
Journal of Experimental…	36
International Journal of…	35
More ▼

Journal Articles	5882
Reports - Research	5592
Reports - Evaluative	1556
Speeches/Meeting Papers	1168
Reports - Descriptive	796
Tests/Questionnaires	768
Guides - Classroom - Teacher	472
Guides - Non-Classroom	259
Dissertations/Theses -…	251
Numerical/Quantitative Data	185
Information Analyses	179
Opinion Papers	164
Guides - Classroom - Learner	162
Books	54
Collected Works - General	33
Multilingual/Bilingual…	32
Guides - General	31
Reports - General	21
Book/Product Reviews	20
ERIC Publications	20
Non-Print Media	16
ERIC Digests in Full Text	14
Collected Works - Proceedings	13
Reference Materials - General	13
Collected Works - Serials	12
More ▼

No Child Left Behind Act 2001	36
Individuals with Disabilities…	20
Every Student Succeeds Act…	5
Elementary and Secondary…	4
Race to the Top	4
Rehabilitation Act 1973…	4
Elementary and Secondary…	3
Head Start	3
Americans with Disabilities…	2
Comprehensive Education…	2
Higher Education Act…	2
Immigration Reform and…	2
Civil Rights Act 1964	1
Civil Rights Act 1964 Title…	1
Comprehensive Employment and…	1
Education Consolidation…	1
Education for All Handicapped…	1
Fair Labor Standards Act	1
Higher Education Act Title II	1
Higher Education Opportunity…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Jeanne Clery Disclosure of…	1
Job Training Partnership Act…	1
Kentucky Education Reform Act…	1
More ▼

National Assessment of…	182
Program for International…	179
SAT (College Admission Test)	137
Trends in International…	114
Test of English as a Foreign…	85
Graduate Record Examinations	74
ACT Assessment	44
Advanced Placement…	34
Texas Educational Assessment…	32
Law School Admission Test	30
Wechsler Intelligence Scale…	26
Iowa Tests of Basic Skills	25
Progress in International…	25
Stanford Achievement Tests	24
Raven Progressive Matrices	22
Armed Services Vocational…	20
International English…	20
Peabody Picture Vocabulary…	20
California Achievement Tests	18
Comprehensive Tests of Basic…	18
Test of English for…	17
Metropolitan Achievement Tests	15
General Educational…	14
Graduate Management Admission…	14
Wechsler Adult Intelligence…	13
More ▼