ERIC - Search Results

Publication Date

In 2026	0
Since 2025	215
Since 2022 (last 5 years)	1084
Since 2017 (last 10 years)	2594
Since 2007 (last 20 years)	4955

Descriptor

Test Items	9547
Test Construction	2723
Foreign Countries	2184
Item Response Theory	1872
Difficulty Level	1623
Item Analysis	1502
Test Validity	1416
Test Reliability	1187
Multiple Choice Tests	1158
Scores	1137
Computer Assisted Testing	1058
Comparative Analysis	1024
Test Format	956
Higher Education	877
Achievement Tests	855
Statistical Analysis	852
Mathematics Tests	845
Psychometrics	833
Test Bias	772
Models	754
Student Evaluation	736
Language Tests	699
Correlation	695
Evaluation Methods	674
Scoring	633
More ▼

Author

van der Linden, Wim J.	69
Tindal, Gerald	50
Hambleton, Ronald K.	45
Alonzo, Julie	41
Chang, Hua-Hua	40
Plake, Barbara S.	40
Sinharay, Sandip	37
Reckase, Mark D.	36
Wainer, Howard	33
Dorans, Neil J.	32
Gierl, Mark J.	30
Sireci, Stephen G.	28
Wang, Wen-Chung	26
Cohen, Allan S.	25
Meijer, Rob R.	25
Samejima, Fumiko	24
Stocking, Martha L.	24
Anderson, Daniel	23
Zwick, Rebecca	23
Veldkamp, Bernard P.	22
Haladyna, Thomas M.	21
Kim, Seock-Ho	21
Wise, Steven L.	21
Kim, Sooyeon	20
More ▼

Education Level

Higher Education	1314
Postsecondary Education	1064
Secondary Education	927
Elementary Education	716
Middle Schools	420
High Schools	363
Elementary Secondary Education	359
Junior High Schools	320
Grade 8	256
Intermediate Grades	209
Grade 4	183
Early Childhood Education	178
Grade 5	134
Primary Education	126
Grade 7	113
Grade 3	111
Grade 6	107
Grade 9	69
Grade 2	56
Grade 10	53
Grade 12	52
Kindergarten	50
Adult Education	39
Grade 11	38
Grade 1	36
More ▼

Audience

Practitioners	653
Teachers	563
Researchers	250
Students	201
Administrators	81
Policymakers	22
Parents	17
Counselors	8
Community	7
Support Staff	3
Media Staff	1
More ▼

Location

Turkey	226
Canada	223
Australia	155
Germany	116
United States	99
China	90
Florida	86
Indonesia	82
Taiwan	78
United Kingdom	73
California	66
Japan	65
Netherlands	64
Iran	62
United Kingdom (England)	57
South Africa	48
New York	46
Missouri	45
Oklahoma	44
South Korea	44
Malaysia	42
Texas	42
Sweden	38
Israel	37
Singapore	37
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	4
Meets WWC Standards with or without Reservations	4
Does not meet standards	1

Showing 4,891 to 4,905 of 9,547 results Save | Export

Detection of Differential Item Functioning in Large-Scale State Assessments: A Study Evaluating a Two-Stage Approach.

Peer reviewed

Zenisky, April L.; Hambleton, Ronald K.; Robin, Frederic – Educational and Psychological Measurement, 2003

Studied a two-stage methodology for evaluating differential item functioning (DIF) in large-scale assessment data using a sample of 60,000 students taking a large-scale assessment. Findings illustrate the merit of iterative approached for DIF detection, since items identified at one stage were not necessarily the same as those identified at the…

Descriptors: Item Bias, Large Scale Assessment, Research Methodology, Test Items

Differential Item Functioning Results May Change Depending on How an Item Is Scored: An Illustration with the Center for Epidemiologic Studies Depression Scale.

Peer reviewed

Gelin, Michaela N.; Zumbo, Bruno D. – Educational and Psychological Measurement, 2003

Investigated potentially biased scale items on the Center for Epidemiological Studies Depression scale (CES-D; Radloff, 1977) in a sample of 600 adults. Overall, results indicate that the scoring method has an effect on differential item functioning (DIF), and that DIF is a property of the item, scoring method, and purpose of the assessment. (SLD)

Descriptors: Depression (Psychology), Item Bias, Scoring, Test Items

Illustrating the Use of Nonparametric Regression To Assess Differential Item and Bundle Functioning among Multiple Groups.

Peer reviewed

Gierl, Mark J.; Bolt, Daniel M. – International Journal of Testing, 2001

Presents an overview of nonparametric regression as it allies to differential item functioning analysis and then provides three examples to illustrate how nonparametric regression can be applied to multilingual, multicultural data to study group differences. (SLD)

Descriptors: Groups, Item Bias, Nonparametric Statistics, Regression (Statistics)

Advances in Career Assessment and the 1994 Strong Interest Inventory.

Peer reviewed

Harmon, Lenore W.; Borgen, Fred H. – Journal of Career Assessment, 1995

Data from over 50,000 people in 50 occupational groups were used to revise the Strong Interest Inventory. New General Reference Samples containing over 18,000 people were used to construct scales, and nearly every scale was revised. (SK)

Descriptors: Evaluation Criteria, Interest Inventories, Measures (Individuals), Occupations

Evaluating the Accuracy of Judgments Obtained from Item Review Committees.

Peer reviewed

Engelhard, George, Jr.; Davis, Melodee; Hansche, Linda – Applied Measurement in Education, 1999

Examined whether reviewers on item-review committees can identify accurately test items that exhibit a variety of flaws. Results with 39 reviewers of a 75-item test show that reviewers exhibit fairly high accuracy rates overall, with statistically significant differences in judgmental accuracy among reviewers. (SLD)

Descriptors: Decision Making, Judges, Review (Reexamination), Test Construction

Estimating Reliability under a Generalizability Theory Model for Test Scores Composed of Testlets.

Peer reviewed

Lee, Guemin; Frisbie, David A. – Applied Measurement in Education, 1999

Studied the appropriateness and implications of using a generalizability theory approach to estimating the reliability of scores from tests composed of testlets. Analyses of data from two national standardization samples suggest that manipulating the number of passages is a more productive way to obtain efficient measurement than manipulating the…

Descriptors: Generalizability Theory, Models, National Surveys, Reliability

A Comparison of the Person Response Function to the "l(z)" Person-Fit Statistic.

Peer reviewed

Nering, Michael L.; Meijer, Rob R. – Applied Psychological Measurement, 1998

Compared the person-response function (PRF) method for identifying examinees who respond to test items in a manner divergent from the underlying test model to the "l(z)" index of Drasgow and others (1985). Although performance of the "l(z)" index was superior in most cases, the PRF was useful in some conditions. (SLD)

Descriptors: Comparative Analysis, Item Response Theory, Models, Responses

Simultaneous Assembly of Multiple Test Forms.

Peer reviewed

van der Linden, Wim J.; Adema, Jos J. – Journal of Educational Measurement, 1998

Proposes an algorithm for the assembly of multiple test forms in which the multiple-form problem is reduced to a series of computationally less intensive two-form problems. Illustrates how the method can be implemented using 0-1 linear programming and gives two examples. (SLD)

Descriptors: Algorithms, Linear Programming, Test Construction, Test Format

Application of a Method of Estimating DIF for Polytomous Test Items.

Peer reviewed

Camilli, Gregory; Congdon, Peter – Journal of Educational and Behavioral Statistics, 1999

Demonstrates a method for studying differential item functioning (DIF) that can be used with dichotomous or polytomous items and that is valid for data that follow a partial credit Item Response Theory model. A simulation study shows that positively biased Type I error rates are in accord with results from previous studies. (SLD)

Descriptors: Estimation (Mathematics), Item Bias, Item Response Theory, Test Items

Item Sequencing Effects on the Measurement of Fluid Intelligence.

Peer reviewed

Carlstedt, Berit; Gustafsson, Jan-Eric; Ullstadius, Eva – Intelligence, 2000

Studied whether a change of test item sequencing, intended to increase test complexity, would cause increased involvement of general intelligence using a sample of Swedish military recruits who received heterogeneous (n=1,778) or homogeneous (n=363) tests. Items presented homogeneously showed higher general intelligence ("G") loadings.…

Descriptors: Foreign Countries, Intelligence, Military Personnel, Test Construction

Optimization of Classical Reliability in Test Construction.

Peer reviewed

Armstrong, Ronald D.; Jones, Douglas H.; Wang, Zhaobo – Journal of Educational and Behavioral Statistics, 1998

Generating a test from an item bank using a criterion based on classical test theory parameters poses considerable problems. A mathematical model is formulated that maximizes the reliability coefficient alpha, subject to logical constraints on the choice of items. Theorems ensuring appropriate application of the Lagragian relation techniques are…

Descriptors: Item Banks, Mathematical Models, Reliability, Test Construction

Detecting Multidimensionality: Which Residual Data-Type Works Best?

Peer reviewed

Linacre, John Michael – Journal of Outcome Measurement, 1998

Simulation studies indicate that, for responses to complete tests, construction of Rasch measures from observational data, followed by principal components factor analysis of Rasch residuals, provides an effective means of identifying multidimensionality. The most diagnostically useful residual form was found to be the standardized residual. (SLD)

Descriptors: Factor Analysis, Identification, Item Response Theory, Simulation

Investigating Local Dependence with Conditional Covariance Functions.

Peer reviewed

Douglas, Jeff; Kim, Hae Rim; Habing, Brian; Gao, Furong – Journal of Educational and Behavioral Statistics, 1998

The local dependence of item pairs is investigated through a conditional covariance function estimation procedure. The conditioning variable used is obtained by a monotonic transformation of total score on the remaining items. Conditional covariance functions are estimated by using kernel smoothing. Several models of local dependence are…

Descriptors: Analysis of Covariance, Estimation (Mathematics), Models, Scores

Using Item Mean Squares To Evaluate Fit to the Rasch Model.

Peer reviewed

Smith, Richard M.; Schumacker, Randall E.; Bush, M. Joan – Journal of Outcome Measurement, 1998

Using item mean squares to evaluate fit to the Rasch model was studied, also considering the transformed version of the item fit statistics. Simulations demonstrate that the critical value for the mean square used to detect misfit is affected by the type of mean square and the number of persons in the calibration. (SLD)

Descriptors: Goodness of Fit, Item Response Theory, Simulation, Test Items

Item Analysis by the Hierarchical Generalized Linear Model.

Peer reviewed

Kamata, Akihito – Journal of Educational Measurement, 2001

Presents the hierarchical generalized linear model (HGLM) as an explicit two-level formulation of a multilevel item response model. Shows that the HGLM is equivalent to the Rasch model, and that a characteristic of the HGLM is that person ability can be expressed as a latent regression model with person-characteristic variables. Shows that the…

Descriptors: Item Analysis, Item Response Theory, Regression (Statistics), Test Items

« Previous Page | Next Page »

Pages: 1 | ... | 323 | 324 | 325 | 326 | 327 | 328 | 329 | 330 | 331 | ... | 637

Educational and Psychological…	416
Journal of Educational…	367
ProQuest LLC	246
Applied Psychological…	234
Applied Measurement in…	231
ETS Research Report Series	146
Educational Measurement:…	128
Journal of Educational and…	122
Online Submission	115
International Journal of…	105
Grantee Submission	98
Language Testing	93
Psychometrika	93
International Journal of…	80
Journal of Psychoeducational…	72
Educational Assessment	70
Practical Assessment,…	60
Measurement:…	57
Language Assessment Quarterly	55
Journal of Chemical Education	54
Behavioral Research and…	50
Journal of Experimental…	45
Physical Review Physics…	38
Journal of Experimental…	36
International Journal of…	35
More ▼

Journal Articles	5882
Reports - Research	5592
Reports - Evaluative	1556
Speeches/Meeting Papers	1168
Reports - Descriptive	796
Tests/Questionnaires	768
Guides - Classroom - Teacher	472
Guides - Non-Classroom	259
Dissertations/Theses -…	251
Numerical/Quantitative Data	185
Information Analyses	179
Opinion Papers	164
Guides - Classroom - Learner	162
Books	54
Collected Works - General	33
Multilingual/Bilingual…	32
Guides - General	31
Reports - General	21
Book/Product Reviews	20
ERIC Publications	20
Non-Print Media	16
ERIC Digests in Full Text	14
Collected Works - Proceedings	13
Reference Materials - General	13
Collected Works - Serials	12
More ▼

No Child Left Behind Act 2001	36
Individuals with Disabilities…	20
Every Student Succeeds Act…	5
Elementary and Secondary…	4
Race to the Top	4
Rehabilitation Act 1973…	4
Elementary and Secondary…	3
Head Start	3
Americans with Disabilities…	2
Comprehensive Education…	2
Higher Education Act…	2
Immigration Reform and…	2
Civil Rights Act 1964	1
Civil Rights Act 1964 Title…	1
Comprehensive Employment and…	1
Education Consolidation…	1
Education for All Handicapped…	1
Fair Labor Standards Act	1
Higher Education Act Title II	1
Higher Education Opportunity…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Jeanne Clery Disclosure of…	1
Job Training Partnership Act…	1
Kentucky Education Reform Act…	1
More ▼

National Assessment of…	182
Program for International…	179
SAT (College Admission Test)	137
Trends in International…	114
Test of English as a Foreign…	85
Graduate Record Examinations	74
ACT Assessment	44
Advanced Placement…	34
Texas Educational Assessment…	32
Law School Admission Test	30
Wechsler Intelligence Scale…	26
Iowa Tests of Basic Skills	25
Progress in International…	25
Stanford Achievement Tests	24
Raven Progressive Matrices	22
Armed Services Vocational…	20
International English…	20
Peabody Picture Vocabulary…	20
California Achievement Tests	18
Comprehensive Tests of Basic…	18
Test of English for…	17
Metropolitan Achievement Tests	15
General Educational…	14
Graduate Management Admission…	14
Wechsler Adult Intelligence…	13
More ▼