ERIC - Search Results

Publication Date

In 2026	0
Since 2025	200
Since 2022 (last 5 years)	1070
Since 2017 (last 10 years)	2580
Since 2007 (last 20 years)	4941

Descriptor

Test Items	9533
Test Construction	2717
Foreign Countries	2181
Item Response Theory	1868
Difficulty Level	1620
Item Analysis	1501
Test Validity	1415
Test Reliability	1186
Multiple Choice Tests	1156
Scores	1136
Computer Assisted Testing	1057
Comparative Analysis	1024
Test Format	956
Higher Education	877
Achievement Tests	854
Statistical Analysis	850
Mathematics Tests	845
Psychometrics	832
Test Bias	770
Models	753
Student Evaluation	736
Language Tests	699
Correlation	695
Evaluation Methods	674
Scoring	633
More ▼

Author

van der Linden, Wim J.	69
Tindal, Gerald	50
Hambleton, Ronald K.	45
Alonzo, Julie	41
Chang, Hua-Hua	40
Plake, Barbara S.	40
Sinharay, Sandip	37
Reckase, Mark D.	36
Wainer, Howard	33
Dorans, Neil J.	32
Gierl, Mark J.	30
Sireci, Stephen G.	28
Wang, Wen-Chung	26
Cohen, Allan S.	25
Meijer, Rob R.	25
Samejima, Fumiko	24
Stocking, Martha L.	24
Anderson, Daniel	23
Zwick, Rebecca	23
Veldkamp, Bernard P.	22
Haladyna, Thomas M.	21
Kim, Seock-Ho	21
Wise, Steven L.	21
Kim, Sooyeon	20
More ▼

Education Level

Higher Education	1310
Postsecondary Education	1060
Secondary Education	925
Elementary Education	715
Middle Schools	419
High Schools	362
Elementary Secondary Education	358
Junior High Schools	319
Grade 8	255
Intermediate Grades	209
Grade 4	183
Early Childhood Education	177
Grade 5	134
Primary Education	126
Grade 7	113
Grade 3	111
Grade 6	107
Grade 9	68
Grade 2	56
Grade 10	52
Grade 12	52
Kindergarten	50
Adult Education	39
Grade 11	37
Grade 1	36
More ▼

Audience

Practitioners	653
Teachers	563
Researchers	250
Students	201
Administrators	81
Policymakers	22
Parents	17
Counselors	8
Community	7
Support Staff	3
Media Staff	1
More ▼

Location

Turkey	225
Canada	223
Australia	155
Germany	116
United States	99
China	90
Florida	86
Indonesia	82
Taiwan	78
United Kingdom	73
California	65
Japan	65
Netherlands	64
Iran	62
United Kingdom (England)	57
South Africa	48
Missouri	45
New York	45
Oklahoma	44
South Korea	44
Malaysia	42
Texas	42
Israel	37
Singapore	37
Sweden	37
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	4
Meets WWC Standards with or without Reservations	4
Does not meet standards	1

Showing 6,241 to 6,255 of 9,533 results Save | Export

Empirical Bayes Estimates of Domain Scores under Binomial and Hypergeometric Distributions for Test Scores.

Peer reviewed

Lin, Miao-Hsiang; Hsiung, Chao A. – Psychometrika, 1994

Two simple empirical approximate Bayes estimators are introduced for estimating domain scores under binomial and hypergeometric distributions respectively. Criteria are established regarding use of these functions over maximum likelihood estimation counterparts. (SLD)

Descriptors: Adaptive Testing, Bayesian Statistics, Computation, Equations (Mathematics)

Reliability of Comparably Written Two-Option Multiple-Choice and True-False Test Items.

Peer reviewed

Hancock, Gregory R.; And Others – Educational and Psychological Measurement, 1993

Two-option multiple-choice vocabulary test items are compared with comparably written true-false test items. Results from a study with 111 high school students suggest that multiple-choice items provide a significantly more reliable measure than the true-false format. (SLD)

Descriptors: Ability, High School Students, High Schools, Objective Tests

Examining Expert Judgments of Task Difficulty on Essay Tests.

Peer reviewed

Hamp-Lyons, Liz; Mathias, Sheila Prochnow – Journal of Second Language Writing, 1994

Expert judgments of prompt difficulty in essay tests were examined to discover whether they could be used at the item-writing stage of test development. Findings show that "expert judges" share considerable agreement about prompt difficulty and prompt task type, but they cannot predict which prompts will result in high or low scores for…

Descriptors: Cues, English (Second Language), Essay Tests, Language Tests

Criterion-Referenced Testing 30 Years Later: Promise Broken, Promise Kept.

Peer reviewed

Millman, Jason – Educational Measurement: Issues and Practice, 1994

The unfulfilled promise of criterion-referenced measurement is that it would permit valid inferences about what a student could and could not do. To come closest to achieving all that criterion-referenced testing originally promised, tests of higher item density, with more items per amount of domain, are required. (SLD)

Descriptors: Criterion Referenced Tests, Educational History, Inferences, Norm Referenced Tests

Influence of Test and Person Characteristics on Nonparametric Appropriateness Measurement.

Peer reviewed

Meijer, Rob R.; And Others – Applied Psychological Measurement, 1994

The power of the nonparametric person-fit statistic, U3, is investigated through simulations as a function of item characteristics, test characteristics, person characteristics, and the group to which examinees belong. Results suggest conditions under which relatively short tests can be used for person-fit analysis. (SLD)

Descriptors: Difficulty Level, Group Membership, Item Response Theory, Nonparametric Statistics

The Relation between Information-Processing Variables and Test-Retest Stability for Questionnaire Items.

Peer reviewed

Otter, Martha E.; And Others – Journal of Educational Measurement, 1995

The ability of 2 components, interpretation of a question and memory, to forecast the test-retest association coefficients of reading test items was studied with initial samples of 916 elementary and 949 secondary school students. For both populations, both components forecast the relative sizes of test-retest correlation coefficients. (SLD)

Descriptors: Cognitive Processes, Comprehension, Correlation, Elementary School Students

A Comparison of Item Calibration Media in Computerized Adaptive Testing.

Peer reviewed

Hetter, Rebecca D.; And Others – Applied Psychological Measurement, 1994

Effects on computerized adaptive test score of using a paper-and-pencil (P&P) calibration to select items and estimate scores were compared with effects of using computer calibration. Results with 2,999 Navy recruits support the use of item parameters calibrated from either P&P or computer administrations. (SLD)

Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Estimation (Mathematics)

Item Homogeneity in Verbal Tests: A Rasch Analysis of Amthauer's Verbal Tests.

Peer reviewed

Van der Ven, Ad H. G. S. – Educational and Psychological Measurement, 1992

The dichotomous Rasch model was applied to verbal subtest scores on the Intelligence Structure Test Battery for 905 12- to 15-year-old secondary school students in the Netherlands. Results suggest that, if any factor is used to increase difficulty of items, that factor should be used on all items. (SLD)

Descriptors: Difficulty Level, Foreign Countries, Intelligence Tests, Secondary Education

Identifying Sources of Bias in the WISC-R.

Vance, Booney; Sabatino, David – Diagnostique, 1991

The issues of construct validity, predictive validity, and item content bias on the Wechsler Intelligence Scale for Children-Revised (WISC-R) are examined. The review concludes that most objective data have not supported the issue of bias of the WISC-R when used with children of different ethnic backgrounds. (JDD)

Descriptors: Construct Validity, Content Validity, Elementary Secondary Education, Ethnic Groups

Analyzing Test Content Using Cluster Analysis and Multidimensional Scaling.

Peer reviewed

Sireci, Stephen G.; Geisinger, Kurt F. – Applied Psychological Measurement, 1992

A new method for evaluating the content representation of a test is illustrated. Item similarity ratings were obtained from three content domain experts to assess whether ratings corresponded to item groupings specified in the test blueprint. Multidimensional scaling and cluster analysis provided substantial information about the test's content…

Descriptors: Cluster Analysis, Content Analysis, Multidimensional Scaling, Multiple Choice Tests

Reading Skills: Hierarchies, Implicational Relationships and Identifiability.

Peer reviewed

Weir, C. J.; And Others – Reading in a Foreign Language, 1990

Presents critical analysis of an earlier article, and argues that, although the validity of the High/Low distinction is questionable, it is possible for practical testing purposes to obtain reliable judgments from properly selected and trained judges. (seven references) (GLR)

Descriptors: Evaluation Methods, Reading Comprehension, Reading Tests, Second Language Learning

Using Multidimensional Item Response Theory to Understand What Items and Tests Are Measuring.

Peer reviewed

Ackerman, Terry A. – Applied Measurement in Education, 1994

When item response data do not satisfy the unidimensionality assumption, multidimensional item response theory (MIRT) should be used to model the item-examinee interaction. This article presents and discusses MIRT analyses designed to give better insight into what individual items are measuring. (SLD)

Descriptors: Evaluation Methods, Item Response Theory, Measurement Techniques, Models

Three Approaches to Determining the Dimensionality of Binary Items.

Peer reviewed

Roznowski, Mary; And Others – Applied Psychological Measurement, 1991

Three heuristic methods of assessing the dimensionality of binary item pools were evaluated in a Monte Carlo investigation. The indices were based on (1) the local independence of unidimensional tests; (2) patterns of second-factor loadings derived from simplex theory; and (3) the shape of the curve of successive eigenvalues. (SLD)

Descriptors: Comparative Analysis, Computer Simulation, Correlation, Evaluation Methods

Differential Testlet Functioning: Definitions and Detection.

Peer reviewed

Wainer, Howard; And Others – Journal of Educational Measurement, 1991

A testlet is an integrated group of test items presented as a unit. The concept of testlet differential item functioning (testlet DIF) is defined, and a statistical method is presented to detect testlet DIF. Data from a testlet-based experimental version of the Scholastic Aptitude Test illustrate the methodology. (SLD)

Descriptors: College Entrance Examinations, Definitions, Graphs, Item Bias

An Approximation for the Bias Function of the Maximum Likelihood Estimate of a Latent Variable for the General Case Where the Item Responses Are Discrete.

Peer reviewed

Samejima, Fumiko – Psychometrika, 1993

An approximation for the bias function of the maximum likelihood estimate of the latent trait or ability is developed for the general case where item responses are discrete, which includes the dichotomous response level, the graded response level, and the nominal response level. (SLD)

Descriptors: Ability, Equations (Mathematics), Estimation (Mathematics), Item Response Theory

« Previous Page | Next Page »

Pages: 1 | ... | 413 | 414 | 415 | 416 | 417 | 418 | 419 | 420 | 421 | ... | 636

Educational and Psychological…	416
Journal of Educational…	359
ProQuest LLC	246
Applied Psychological…	234
Applied Measurement in…	231
ETS Research Report Series	146
Educational Measurement:…	128
Journal of Educational and…	122
Online Submission	115
International Journal of…	105
Grantee Submission	98
Language Testing	93
Psychometrika	93
International Journal of…	79
Journal of Psychoeducational…	72
Educational Assessment	70
Measurement:…	57
Practical Assessment,…	56
Language Assessment Quarterly	55
Journal of Chemical Education	54
Behavioral Research and…	50
Journal of Experimental…	45
Physical Review Physics…	38
Journal of Experimental…	36
International Journal of…	35
More ▼

Journal Articles	5869
Reports - Research	5578
Reports - Evaluative	1556
Speeches/Meeting Papers	1168
Reports - Descriptive	796
Tests/Questionnaires	768
Guides - Classroom - Teacher	472
Guides - Non-Classroom	259
Dissertations/Theses -…	251
Numerical/Quantitative Data	185
Information Analyses	179
Opinion Papers	164
Guides - Classroom - Learner	162
Books	54
Collected Works - General	33
Multilingual/Bilingual…	32
Guides - General	31
Reports - General	21
Book/Product Reviews	20
ERIC Publications	20
Non-Print Media	16
ERIC Digests in Full Text	14
Collected Works - Proceedings	13
Reference Materials - General	13
Collected Works - Serials	12
More ▼

No Child Left Behind Act 2001	36
Individuals with Disabilities…	20
Every Student Succeeds Act…	5
Elementary and Secondary…	4
Race to the Top	4
Rehabilitation Act 1973…	4
Elementary and Secondary…	3
Head Start	3
Americans with Disabilities…	2
Comprehensive Education…	2
Higher Education Act…	2
Immigration Reform and…	2
Civil Rights Act 1964	1
Civil Rights Act 1964 Title…	1
Comprehensive Employment and…	1
Education Consolidation…	1
Education for All Handicapped…	1
Fair Labor Standards Act	1
Higher Education Act Title II	1
Higher Education Opportunity…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Jeanne Clery Disclosure of…	1
Job Training Partnership Act…	1
Kentucky Education Reform Act…	1
More ▼

National Assessment of…	182
Program for International…	178
SAT (College Admission Test)	137
Trends in International…	114
Test of English as a Foreign…	85
Graduate Record Examinations	74
ACT Assessment	44
Advanced Placement…	34
Texas Educational Assessment…	32
Law School Admission Test	30
Wechsler Intelligence Scale…	26
Iowa Tests of Basic Skills	25
Progress in International…	25
Stanford Achievement Tests	24
Raven Progressive Matrices	22
Armed Services Vocational…	20
International English…	20
Peabody Picture Vocabulary…	20
California Achievement Tests	18
Comprehensive Tests of Basic…	18
Test of English for…	17
Metropolitan Achievement Tests	15
General Educational…	14
Graduate Management Admission…	14
Wechsler Adult Intelligence…	13
More ▼