ERIC - Search Results

Publication Date

In 2026	0
Since 2025	215
Since 2022 (last 5 years)	1084
Since 2017 (last 10 years)	2594
Since 2007 (last 20 years)	4955

Descriptor

Test Items	9547
Test Construction	2723
Foreign Countries	2184
Item Response Theory	1872
Difficulty Level	1623
Item Analysis	1502
Test Validity	1416
Test Reliability	1187
Multiple Choice Tests	1158
Scores	1137
Computer Assisted Testing	1058
Comparative Analysis	1024
Test Format	956
Higher Education	877
Achievement Tests	855
Statistical Analysis	852
Mathematics Tests	845
Psychometrics	833
Test Bias	772
Models	754
Student Evaluation	736
Language Tests	699
Correlation	695
Evaluation Methods	674
Scoring	633
More ▼

Author

van der Linden, Wim J.	69
Tindal, Gerald	50
Hambleton, Ronald K.	45
Alonzo, Julie	41
Chang, Hua-Hua	40
Plake, Barbara S.	40
Sinharay, Sandip	37
Reckase, Mark D.	36
Wainer, Howard	33
Dorans, Neil J.	32
Gierl, Mark J.	30
Sireci, Stephen G.	28
Wang, Wen-Chung	26
Cohen, Allan S.	25
Meijer, Rob R.	25
Samejima, Fumiko	24
Stocking, Martha L.	24
Anderson, Daniel	23
Zwick, Rebecca	23
Veldkamp, Bernard P.	22
Haladyna, Thomas M.	21
Kim, Seock-Ho	21
Wise, Steven L.	21
Kim, Sooyeon	20
More ▼

Education Level

Higher Education	1314
Postsecondary Education	1064
Secondary Education	927
Elementary Education	716
Middle Schools	420
High Schools	363
Elementary Secondary Education	359
Junior High Schools	320
Grade 8	256
Intermediate Grades	209
Grade 4	183
Early Childhood Education	178
Grade 5	134
Primary Education	126
Grade 7	113
Grade 3	111
Grade 6	107
Grade 9	69
Grade 2	56
Grade 10	53
Grade 12	52
Kindergarten	50
Adult Education	39
Grade 11	38
Grade 1	36
More ▼

Audience

Practitioners	653
Teachers	563
Researchers	250
Students	201
Administrators	81
Policymakers	22
Parents	17
Counselors	8
Community	7
Support Staff	3
Media Staff	1
More ▼

Location

Turkey	226
Canada	223
Australia	155
Germany	116
United States	99
China	90
Florida	86
Indonesia	82
Taiwan	78
United Kingdom	73
California	66
Japan	65
Netherlands	64
Iran	62
United Kingdom (England)	57
South Africa	48
New York	46
Missouri	45
Oklahoma	44
South Korea	44
Malaysia	42
Texas	42
Sweden	38
Israel	37
Singapore	37
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	4
Meets WWC Standards with or without Reservations	4
Does not meet standards	1

Showing 4,906 to 4,920 of 9,547 results Save | Export

Evaluating the Accuracy of Judgments Obtained from Item Review Committees.

Peer reviewed

Engelhard, George, Jr.; Davis, Melodee; Hansche, Linda – Applied Measurement in Education, 1999

Examined whether reviewers on item-review committees can identify accurately test items that exhibit a variety of flaws. Results with 39 reviewers of a 75-item test show that reviewers exhibit fairly high accuracy rates overall, with statistically significant differences in judgmental accuracy among reviewers. (SLD)

Descriptors: Decision Making, Judges, Review (Reexamination), Test Construction

Estimating Reliability under a Generalizability Theory Model for Test Scores Composed of Testlets.

Peer reviewed

Lee, Guemin; Frisbie, David A. – Applied Measurement in Education, 1999

Studied the appropriateness and implications of using a generalizability theory approach to estimating the reliability of scores from tests composed of testlets. Analyses of data from two national standardization samples suggest that manipulating the number of passages is a more productive way to obtain efficient measurement than manipulating the…

Descriptors: Generalizability Theory, Models, National Surveys, Reliability

A Comparison of the Person Response Function to the "l(z)" Person-Fit Statistic.

Peer reviewed

Nering, Michael L.; Meijer, Rob R. – Applied Psychological Measurement, 1998

Compared the person-response function (PRF) method for identifying examinees who respond to test items in a manner divergent from the underlying test model to the "l(z)" index of Drasgow and others (1985). Although performance of the "l(z)" index was superior in most cases, the PRF was useful in some conditions. (SLD)

Descriptors: Comparative Analysis, Item Response Theory, Models, Responses

Simultaneous Assembly of Multiple Test Forms.

Peer reviewed

van der Linden, Wim J.; Adema, Jos J. – Journal of Educational Measurement, 1998

Proposes an algorithm for the assembly of multiple test forms in which the multiple-form problem is reduced to a series of computationally less intensive two-form problems. Illustrates how the method can be implemented using 0-1 linear programming and gives two examples. (SLD)

Descriptors: Algorithms, Linear Programming, Test Construction, Test Format

Application of a Method of Estimating DIF for Polytomous Test Items.

Peer reviewed

Camilli, Gregory; Congdon, Peter – Journal of Educational and Behavioral Statistics, 1999

Demonstrates a method for studying differential item functioning (DIF) that can be used with dichotomous or polytomous items and that is valid for data that follow a partial credit Item Response Theory model. A simulation study shows that positively biased Type I error rates are in accord with results from previous studies. (SLD)

Descriptors: Estimation (Mathematics), Item Bias, Item Response Theory, Test Items

Item Sequencing Effects on the Measurement of Fluid Intelligence.

Peer reviewed

Carlstedt, Berit; Gustafsson, Jan-Eric; Ullstadius, Eva – Intelligence, 2000

Studied whether a change of test item sequencing, intended to increase test complexity, would cause increased involvement of general intelligence using a sample of Swedish military recruits who received heterogeneous (n=1,778) or homogeneous (n=363) tests. Items presented homogeneously showed higher general intelligence ("G") loadings.…

Descriptors: Foreign Countries, Intelligence, Military Personnel, Test Construction

Optimization of Classical Reliability in Test Construction.

Peer reviewed

Armstrong, Ronald D.; Jones, Douglas H.; Wang, Zhaobo – Journal of Educational and Behavioral Statistics, 1998

Generating a test from an item bank using a criterion based on classical test theory parameters poses considerable problems. A mathematical model is formulated that maximizes the reliability coefficient alpha, subject to logical constraints on the choice of items. Theorems ensuring appropriate application of the Lagragian relation techniques are…

Descriptors: Item Banks, Mathematical Models, Reliability, Test Construction

Detecting Multidimensionality: Which Residual Data-Type Works Best?

Peer reviewed

Linacre, John Michael – Journal of Outcome Measurement, 1998

Simulation studies indicate that, for responses to complete tests, construction of Rasch measures from observational data, followed by principal components factor analysis of Rasch residuals, provides an effective means of identifying multidimensionality. The most diagnostically useful residual form was found to be the standardized residual. (SLD)

Descriptors: Factor Analysis, Identification, Item Response Theory, Simulation

Investigating Local Dependence with Conditional Covariance Functions.

Peer reviewed

Douglas, Jeff; Kim, Hae Rim; Habing, Brian; Gao, Furong – Journal of Educational and Behavioral Statistics, 1998

The local dependence of item pairs is investigated through a conditional covariance function estimation procedure. The conditioning variable used is obtained by a monotonic transformation of total score on the remaining items. Conditional covariance functions are estimated by using kernel smoothing. Several models of local dependence are…

Descriptors: Analysis of Covariance, Estimation (Mathematics), Models, Scores

Using Item Mean Squares To Evaluate Fit to the Rasch Model.

Peer reviewed

Smith, Richard M.; Schumacker, Randall E.; Bush, M. Joan – Journal of Outcome Measurement, 1998

Using item mean squares to evaluate fit to the Rasch model was studied, also considering the transformed version of the item fit statistics. Simulations demonstrate that the critical value for the mean square used to detect misfit is affected by the type of mean square and the number of persons in the calibration. (SLD)

Descriptors: Goodness of Fit, Item Response Theory, Simulation, Test Items

Item Analysis by the Hierarchical Generalized Linear Model.

Peer reviewed

Kamata, Akihito – Journal of Educational Measurement, 2001

Presents the hierarchical generalized linear model (HGLM) as an explicit two-level formulation of a multilevel item response model. Shows that the HGLM is equivalent to the Rasch model, and that a characteristic of the HGLM is that person ability can be expressed as a latent regression model with person-characteristic variables. Shows that the…

Descriptors: Item Analysis, Item Response Theory, Regression (Statistics), Test Items

A SIBTEST Approach to Testing DIF Hypotheses Using Experimentally Designed Test Items.

Peer reviewed

Bolt, Daniel M. – Journal of Educational Measurement, 2000

Reviewed aspects of the SIBTEST procedure through three studies. Study 1 examined the effects of item format using 40 mathematics items from the Scholastic Assessment Test. Study 2 considered the effects of a problem type factor and its interaction with item format for eight items, and study 3 evaluated the degree to which factors varied in the…

Descriptors: Computer Software, Hypothesis Testing, Item Bias, Mathematics

Exploring the Logic of Tatsuoka's Rule-Space Model for Test Development and Analysis. An NCME Instructional Module.

Peer reviewed

Gierl, Mark J.; Leighton, Jacqueline P.; Hunka, Stephen M. – Educational Measurement: Issues and Practice, 2000

Discusses the logic of the rule-space model (K. Tatsuoka, 1983) as it applies to test development and analysis. The rule-space model is a statistical method for classifying examinees' test item responses into a set of attribute-mastery patterns associated with different cognitive skills. Directs readers to a tutorial that may be downloaded. (SLD)

Descriptors: Item Analysis, Item Response Theory, Test Construction, Test Items

A Comparison of Competing Models Underlying Responses to the Myers-Briggs Type Indicator.

Peer reviewed

Jackson, Stacy L.; And Others – Journal of Career Assessment, 1996

Factor analysis of 1,030 adults' responses on the Myers Briggs Type Indicator (MBTI) were used to test 4 alternative models. Results support a four-factor structure similar to the original Jungian structure. Elimination of 12 MBTI items was recommended. (SK)

Descriptors: Construct Validity, Factor Analysis, Models, Personality Measures

Rescuing Computerized Testing by Breaking Zipf's Law.

Peer reviewed

Wainer, Howard – Journal of Educational and Behavioral Statistics, 2000

Suggests that because of the nonlinear relationship between item usage and item security, the problems of test security posed by continuous administration of standardized tests cannot be resolved merely by increasing the size of the item pool. Offers alternative strategies to overcome these problems, distributing test items so as to avoid the…

Descriptors: Computer Assisted Testing, Standardized Tests, Test Items, Testing Problems

« Previous Page | Next Page »

Pages: 1 | ... | 324 | 325 | 326 | 327 | 328 | 329 | 330 | 331 | 332 | ... | 637

Educational and Psychological…	416
Journal of Educational…	367
ProQuest LLC	246
Applied Psychological…	234
Applied Measurement in…	231
ETS Research Report Series	146
Educational Measurement:…	128
Journal of Educational and…	122
Online Submission	115
International Journal of…	105
Grantee Submission	98
Language Testing	93
Psychometrika	93
International Journal of…	80
Journal of Psychoeducational…	72
Educational Assessment	70
Practical Assessment,…	60
Measurement:…	57
Language Assessment Quarterly	55
Journal of Chemical Education	54
Behavioral Research and…	50
Journal of Experimental…	45
Physical Review Physics…	38
Journal of Experimental…	36
International Journal of…	35
More ▼

Journal Articles	5882
Reports - Research	5592
Reports - Evaluative	1556
Speeches/Meeting Papers	1168
Reports - Descriptive	796
Tests/Questionnaires	768
Guides - Classroom - Teacher	472
Guides - Non-Classroom	259
Dissertations/Theses -…	251
Numerical/Quantitative Data	185
Information Analyses	179
Opinion Papers	164
Guides - Classroom - Learner	162
Books	54
Collected Works - General	33
Multilingual/Bilingual…	32
Guides - General	31
Reports - General	21
Book/Product Reviews	20
ERIC Publications	20
Non-Print Media	16
ERIC Digests in Full Text	14
Collected Works - Proceedings	13
Reference Materials - General	13
Collected Works - Serials	12
More ▼

No Child Left Behind Act 2001	36
Individuals with Disabilities…	20
Every Student Succeeds Act…	5
Elementary and Secondary…	4
Race to the Top	4
Rehabilitation Act 1973…	4
Elementary and Secondary…	3
Head Start	3
Americans with Disabilities…	2
Comprehensive Education…	2
Higher Education Act…	2
Immigration Reform and…	2
Civil Rights Act 1964	1
Civil Rights Act 1964 Title…	1
Comprehensive Employment and…	1
Education Consolidation…	1
Education for All Handicapped…	1
Fair Labor Standards Act	1
Higher Education Act Title II	1
Higher Education Opportunity…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Jeanne Clery Disclosure of…	1
Job Training Partnership Act…	1
Kentucky Education Reform Act…	1
More ▼

National Assessment of…	182
Program for International…	179
SAT (College Admission Test)	137
Trends in International…	114
Test of English as a Foreign…	85
Graduate Record Examinations	74
ACT Assessment	44
Advanced Placement…	34
Texas Educational Assessment…	32
Law School Admission Test	30
Wechsler Intelligence Scale…	26
Iowa Tests of Basic Skills	25
Progress in International…	25
Stanford Achievement Tests	24
Raven Progressive Matrices	22
Armed Services Vocational…	20
International English…	20
Peabody Picture Vocabulary…	20
California Achievement Tests	18
Comprehensive Tests of Basic…	18
Test of English for…	17
Metropolitan Achievement Tests	15
General Educational…	14
Graduate Management Admission…	14
Wechsler Adult Intelligence…	13
More ▼