ERIC - Search Results

Publication Date

In 2026	0
Since 2025	220
Since 2022 (last 5 years)	1089
Since 2017 (last 10 years)	2599
Since 2007 (last 20 years)	4960

Descriptor

Test Items	9552
Test Construction	2724
Foreign Countries	2185
Item Response Theory	1872
Difficulty Level	1624
Item Analysis	1502
Test Validity	1418
Test Reliability	1189
Multiple Choice Tests	1160
Scores	1137
Computer Assisted Testing	1058
Comparative Analysis	1024
Test Format	956
Higher Education	877
Achievement Tests	855
Statistical Analysis	852
Mathematics Tests	846
Psychometrics	835
Test Bias	772
Models	754
Student Evaluation	736
Language Tests	699
Correlation	695
Evaluation Methods	674
Scoring	633
More ▼

Author

van der Linden, Wim J.	69
Tindal, Gerald	50
Hambleton, Ronald K.	45
Alonzo, Julie	41
Chang, Hua-Hua	40
Plake, Barbara S.	40
Sinharay, Sandip	37
Reckase, Mark D.	36
Wainer, Howard	33
Dorans, Neil J.	32
Gierl, Mark J.	30
Sireci, Stephen G.	28
Wang, Wen-Chung	26
Cohen, Allan S.	25
Meijer, Rob R.	25
Samejima, Fumiko	24
Stocking, Martha L.	24
Anderson, Daniel	23
Zwick, Rebecca	23
Veldkamp, Bernard P.	22
Haladyna, Thomas M.	21
Kim, Seock-Ho	21
Wise, Steven L.	21
Kim, Sooyeon	20
More ▼

Education Level

Higher Education	1316
Postsecondary Education	1066
Secondary Education	928
Elementary Education	716
Middle Schools	421
High Schools	364
Elementary Secondary Education	359
Junior High Schools	321
Grade 8	256
Intermediate Grades	209
Grade 4	183
Early Childhood Education	178
Grade 5	134
Primary Education	126
Grade 7	113
Grade 3	111
Grade 6	107
Grade 9	70
Grade 2	56
Grade 10	53
Grade 12	52
Kindergarten	50
Adult Education	39
Grade 11	38
Grade 1	36
More ▼

Audience

Practitioners	653
Teachers	563
Researchers	250
Students	201
Administrators	81
Policymakers	22
Parents	17
Counselors	8
Community	7
Support Staff	3
Media Staff	1
More ▼

Location

Turkey	226
Canada	223
Australia	155
Germany	116
United States	99
China	90
Florida	86
Indonesia	82
Taiwan	78
United Kingdom	73
California	66
Japan	65
Netherlands	64
Iran	62
United Kingdom (England)	57
South Africa	48
New York	46
Missouri	45
Oklahoma	44
South Korea	44
Malaysia	42
Texas	42
Sweden	38
Israel	37
Singapore	37
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	4
Meets WWC Standards with or without Reservations	4
Does not meet standards	1

Showing 6,256 to 6,270 of 9,552 results Save | Export

The Cloze Test: Or Why Small Isn't Always Beautiful.

Peer reviewed

Sciarone, A. G.; Schoorl, J. J. – Language Learning, 1989

Presents findings from an experiment that sought to determine the minimal number of blanks required to ensure parallelism in cloze tests, differing only in the point at which deletion starts. Results showed the required minimum depended on the scoring methods used, with exact-word tests requiring about 100 blanks and acceptable-word tests…

Descriptors: Cloze Procedure, Dutch, Indonesian, Reading Tests

Unidimensionality versus Statistical Accuracy: A Note on Bejar's Method for Detecting Dimensionality of Achievement Tests.

Peer reviewed

Liou, Michelle – Applied Psychological Measurement, 1988

In applying I. I. Bejar's method for detecting the dimensionality of achievement tests, researchers should be cautious in interpreting the slope of the principal axis. Other information from the data is needed in conjunction with Bejar's method of addressing item dimensionality. (SLD)

Descriptors: Achievement Tests, Computer Simulation, Difficulty Level, Equated Scores

The Item Log-Likelihood Surface for Two- and Three-Parameter Item Characteristic Curve Models.

Peer reviewed

Baker, Frank B. – Applied Psychological Measurement, 1988

The form of item log-likelihood surface was investigated under two-parameter and three-parameter logistic models. Results confirm that the LOGIST program procedures used to locate the maximum of the likelihood functions are consistent with the form of the item log-likelihood surface. (SLD)

Descriptors: Estimation (Mathematics), Factor Analysis, Graphs, Latent Trait Theory

A Note on Decisionmaking Processes for Multiple-Choice Test Items.

Peer reviewed

Wilcox, Rand R.; And Others – Journal of Educational Measurement, 1988

The second response conditional probability model of decision-making strategies used by examinees answering multiple choice test items was revised. Increasing the number of distractors or providing distractors giving examinees (N=106) the option to follow the model improved results and gave a good fit to data for 29 of 30 items. (SLD)

Descriptors: Cognitive Tests, Decision Making, Mathematical Models, Multiple Choice Tests

Empirical Bayes Estimates of Domain Scores under Binomial and Hypergeometric Distributions for Test Scores.

Peer reviewed

Lin, Miao-Hsiang; Hsiung, Chao A. – Psychometrika, 1994

Two simple empirical approximate Bayes estimators are introduced for estimating domain scores under binomial and hypergeometric distributions respectively. Criteria are established regarding use of these functions over maximum likelihood estimation counterparts. (SLD)

Descriptors: Adaptive Testing, Bayesian Statistics, Computation, Equations (Mathematics)

Reliability of Comparably Written Two-Option Multiple-Choice and True-False Test Items.

Peer reviewed

Hancock, Gregory R.; And Others – Educational and Psychological Measurement, 1993

Two-option multiple-choice vocabulary test items are compared with comparably written true-false test items. Results from a study with 111 high school students suggest that multiple-choice items provide a significantly more reliable measure than the true-false format. (SLD)

Descriptors: Ability, High School Students, High Schools, Objective Tests

Examining Expert Judgments of Task Difficulty on Essay Tests.

Peer reviewed

Hamp-Lyons, Liz; Mathias, Sheila Prochnow – Journal of Second Language Writing, 1994

Expert judgments of prompt difficulty in essay tests were examined to discover whether they could be used at the item-writing stage of test development. Findings show that "expert judges" share considerable agreement about prompt difficulty and prompt task type, but they cannot predict which prompts will result in high or low scores for…

Descriptors: Cues, English (Second Language), Essay Tests, Language Tests

Criterion-Referenced Testing 30 Years Later: Promise Broken, Promise Kept.

Peer reviewed

Millman, Jason – Educational Measurement: Issues and Practice, 1994

The unfulfilled promise of criterion-referenced measurement is that it would permit valid inferences about what a student could and could not do. To come closest to achieving all that criterion-referenced testing originally promised, tests of higher item density, with more items per amount of domain, are required. (SLD)

Descriptors: Criterion Referenced Tests, Educational History, Inferences, Norm Referenced Tests

Influence of Test and Person Characteristics on Nonparametric Appropriateness Measurement.

Peer reviewed

Meijer, Rob R.; And Others – Applied Psychological Measurement, 1994

The power of the nonparametric person-fit statistic, U3, is investigated through simulations as a function of item characteristics, test characteristics, person characteristics, and the group to which examinees belong. Results suggest conditions under which relatively short tests can be used for person-fit analysis. (SLD)

Descriptors: Difficulty Level, Group Membership, Item Response Theory, Nonparametric Statistics

The Relation between Information-Processing Variables and Test-Retest Stability for Questionnaire Items.

Peer reviewed

Otter, Martha E.; And Others – Journal of Educational Measurement, 1995

The ability of 2 components, interpretation of a question and memory, to forecast the test-retest association coefficients of reading test items was studied with initial samples of 916 elementary and 949 secondary school students. For both populations, both components forecast the relative sizes of test-retest correlation coefficients. (SLD)

Descriptors: Cognitive Processes, Comprehension, Correlation, Elementary School Students

A Comparison of Item Calibration Media in Computerized Adaptive Testing.

Peer reviewed

Hetter, Rebecca D.; And Others – Applied Psychological Measurement, 1994

Effects on computerized adaptive test score of using a paper-and-pencil (P&P) calibration to select items and estimate scores were compared with effects of using computer calibration. Results with 2,999 Navy recruits support the use of item parameters calibrated from either P&P or computer administrations. (SLD)

Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Estimation (Mathematics)

Item Homogeneity in Verbal Tests: A Rasch Analysis of Amthauer's Verbal Tests.

Peer reviewed

Van der Ven, Ad H. G. S. – Educational and Psychological Measurement, 1992

The dichotomous Rasch model was applied to verbal subtest scores on the Intelligence Structure Test Battery for 905 12- to 15-year-old secondary school students in the Netherlands. Results suggest that, if any factor is used to increase difficulty of items, that factor should be used on all items. (SLD)

Descriptors: Difficulty Level, Foreign Countries, Intelligence Tests, Secondary Education

Identifying Sources of Bias in the WISC-R.

Vance, Booney; Sabatino, David – Diagnostique, 1991

The issues of construct validity, predictive validity, and item content bias on the Wechsler Intelligence Scale for Children-Revised (WISC-R) are examined. The review concludes that most objective data have not supported the issue of bias of the WISC-R when used with children of different ethnic backgrounds. (JDD)

Descriptors: Construct Validity, Content Validity, Elementary Secondary Education, Ethnic Groups

Analyzing Test Content Using Cluster Analysis and Multidimensional Scaling.

Peer reviewed

Sireci, Stephen G.; Geisinger, Kurt F. – Applied Psychological Measurement, 1992

A new method for evaluating the content representation of a test is illustrated. Item similarity ratings were obtained from three content domain experts to assess whether ratings corresponded to item groupings specified in the test blueprint. Multidimensional scaling and cluster analysis provided substantial information about the test's content…

Descriptors: Cluster Analysis, Content Analysis, Multidimensional Scaling, Multiple Choice Tests

Reading Skills: Hierarchies, Implicational Relationships and Identifiability.

Peer reviewed

Weir, C. J.; And Others – Reading in a Foreign Language, 1990

Presents critical analysis of an earlier article, and argues that, although the validity of the High/Low distinction is questionable, it is possible for practical testing purposes to obtain reliable judgments from properly selected and trained judges. (seven references) (GLR)

Descriptors: Evaluation Methods, Reading Comprehension, Reading Tests, Second Language Learning

« Previous Page | Next Page »

Pages: 1 | ... | 414 | 415 | 416 | 417 | 418 | 419 | 420 | 421 | 422 | ... | 637

Educational and Psychological…	416
Journal of Educational…	367
ProQuest LLC	246
Applied Psychological…	234
Applied Measurement in…	231
ETS Research Report Series	146
Educational Measurement:…	128
Journal of Educational and…	122
Online Submission	115
International Journal of…	105
Grantee Submission	98
Language Testing	93
Psychometrika	93
International Journal of…	80
Journal of Psychoeducational…	72
Educational Assessment	70
Practical Assessment,…	60
Measurement:…	57
Language Assessment Quarterly	55
Journal of Chemical Education	54
Behavioral Research and…	50
Journal of Experimental…	45
Physical Review Physics…	40
Journal of Experimental…	36
International Journal of…	35
More ▼

Journal Articles	5887
Reports - Research	5597
Reports - Evaluative	1556
Speeches/Meeting Papers	1168
Reports - Descriptive	796
Tests/Questionnaires	768
Guides - Classroom - Teacher	472
Guides - Non-Classroom	259
Dissertations/Theses -…	251
Numerical/Quantitative Data	185
Information Analyses	179
Opinion Papers	164
Guides - Classroom - Learner	162
Books	54
Collected Works - General	33
Multilingual/Bilingual…	32
Guides - General	31
Reports - General	21
Book/Product Reviews	20
ERIC Publications	20
Non-Print Media	16
ERIC Digests in Full Text	14
Collected Works - Proceedings	13
Reference Materials - General	13
Collected Works - Serials	12
More ▼

No Child Left Behind Act 2001	36
Individuals with Disabilities…	20
Every Student Succeeds Act…	5
Elementary and Secondary…	4
Race to the Top	4
Rehabilitation Act 1973…	4
Elementary and Secondary…	3
Head Start	3
Americans with Disabilities…	2
Comprehensive Education…	2
Higher Education Act…	2
Immigration Reform and…	2
Civil Rights Act 1964	1
Civil Rights Act 1964 Title…	1
Comprehensive Employment and…	1
Education Consolidation…	1
Education for All Handicapped…	1
Fair Labor Standards Act	1
Higher Education Act Title II	1
Higher Education Opportunity…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Jeanne Clery Disclosure of…	1
Job Training Partnership Act…	1
Kentucky Education Reform Act…	1
More ▼

National Assessment of…	182
Program for International…	179
SAT (College Admission Test)	137
Trends in International…	114
Test of English as a Foreign…	85
Graduate Record Examinations	74
ACT Assessment	44
Advanced Placement…	34
Texas Educational Assessment…	32
Law School Admission Test	30
Wechsler Intelligence Scale…	26
Iowa Tests of Basic Skills	25
Progress in International…	25
Stanford Achievement Tests	24
Raven Progressive Matrices	22
Armed Services Vocational…	20
International English…	20
Peabody Picture Vocabulary…	20
California Achievement Tests	18
Comprehensive Tests of Basic…	18
Test of English for…	17
Metropolitan Achievement Tests	15
General Educational…	14
Graduate Management Admission…	14
Wechsler Adult Intelligence…	13
More ▼