ERIC - Search Results

Publication Date

In 2026	0
Since 2025	215
Since 2022 (last 5 years)	1084
Since 2017 (last 10 years)	2594
Since 2007 (last 20 years)	4955

Descriptor

Test Items	9547
Test Construction	2723
Foreign Countries	2184
Item Response Theory	1872
Difficulty Level	1623
Item Analysis	1502
Test Validity	1416
Test Reliability	1187
Multiple Choice Tests	1158
Scores	1137
Computer Assisted Testing	1058
Comparative Analysis	1024
Test Format	956
Higher Education	877
Achievement Tests	855
Statistical Analysis	852
Mathematics Tests	845
Psychometrics	833
Test Bias	772
Models	754
Student Evaluation	736
Language Tests	699
Correlation	695
Evaluation Methods	674
Scoring	633
More ▼

Author

van der Linden, Wim J.	69
Tindal, Gerald	50
Hambleton, Ronald K.	45
Alonzo, Julie	41
Chang, Hua-Hua	40
Plake, Barbara S.	40
Sinharay, Sandip	37
Reckase, Mark D.	36
Wainer, Howard	33
Dorans, Neil J.	32
Gierl, Mark J.	30
Sireci, Stephen G.	28
Wang, Wen-Chung	26
Cohen, Allan S.	25
Meijer, Rob R.	25
Samejima, Fumiko	24
Stocking, Martha L.	24
Anderson, Daniel	23
Zwick, Rebecca	23
Veldkamp, Bernard P.	22
Haladyna, Thomas M.	21
Kim, Seock-Ho	21
Wise, Steven L.	21
Kim, Sooyeon	20
More ▼

Education Level

Higher Education	1314
Postsecondary Education	1064
Secondary Education	927
Elementary Education	716
Middle Schools	420
High Schools	363
Elementary Secondary Education	359
Junior High Schools	320
Grade 8	256
Intermediate Grades	209
Grade 4	183
Early Childhood Education	178
Grade 5	134
Primary Education	126
Grade 7	113
Grade 3	111
Grade 6	107
Grade 9	69
Grade 2	56
Grade 10	53
Grade 12	52
Kindergarten	50
Adult Education	39
Grade 11	38
Grade 1	36
More ▼

Audience

Practitioners	653
Teachers	563
Researchers	250
Students	201
Administrators	81
Policymakers	22
Parents	17
Counselors	8
Community	7
Support Staff	3
Media Staff	1
More ▼

Location

Turkey	226
Canada	223
Australia	155
Germany	116
United States	99
China	90
Florida	86
Indonesia	82
Taiwan	78
United Kingdom	73
California	66
Japan	65
Netherlands	64
Iran	62
United Kingdom (England)	57
South Africa	48
New York	46
Missouri	45
Oklahoma	44
South Korea	44
Malaysia	42
Texas	42
Sweden	38
Israel	37
Singapore	37
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	4
Meets WWC Standards with or without Reservations	4
Does not meet standards	1

Showing 5,041 to 5,055 of 9,547 results Save | Export

Effects of Item and Response Set Reversals on Survey Statistics.

Download full text

Barnette, J. Jackson – 1997

The controversy regarding reverse or negatively-worded survey stems has been around for several decades. The practice has been used to guard against acquiescent or response set behaviors. A 20-item, 5-point Likert item survey was designed and the stems and response sets were varied in a 2 by 3 design. One independent variable was type of item…

Descriptors: Likert Scales, Reliability, Responses, Statistical Analysis

A Compensatory Approach to Optimal Selection with Mastery Scores. Research Report 94-2.

Download full text

van der Linden, Wim J.; Vos, Hans J. – 1994

This paper presents some Bayesian theories of simultaneous optimization of decision rules for test-based decisions. Simultaneous decision making arises when an institution has to make a series of selection, placement, or mastery decisions with respect to subjects from a population. An obvious example is the use of individualized instruction in…

Descriptors: Bayesian Statistics, Decision Making, Foreign Countries, Scores

Evaluation of Procedures for Linking Multidimensional Item Calibrations.

Download full text

Oshima, T. C.; Davey, T. C. – 1994

This paper evaluated multidimensional linking procedures with which multidimensional test data from two separate calibrations were put on a common scale. Data were simulated with known ability distributions varying on two factors which made linking necessary: mean vector differences and variance-covariance (v-c) matrix differences. After the…

Descriptors: Ability, Estimation (Mathematics), Evaluation Methods, Matrices

Effect of a Modified Angoff Strategy for Obtaining Item Performance Estimates in a Standard Setting Study.

Download full text

Plake, Barbara S.; Giraud, Gerald – 1998

In the traditional Angoff Standard Setting Method, experts are instructed to predict the possibility that a randomly selected, hypothetical minimally competent candidate will be able to answer each multiple choice question in the test correctly. These item performance estimates are averaged across panelists and aggregated to determine the minimum…

Descriptors: Estimation (Mathematics), Evaluators, Performance Factors, Standard Setting (Scoring)

Equating Test Forms Composed of Testlets Using Dichotomous and Polytomous IRT Models.

Download full text

Lee, Guemin; Kolen, Michael J.; Frisbie, David A.; Ankenmann, Robert D. – 1998

Item response models can be applied in many test equating situations by making strong statistical assumptions. Thus, studying the robustness of the models to violations of the assumptions and investigating model-data fit are essential in all item response theory (IRT) equating applications (M. Kolen and R. Brennan, 1995). Previous studies dealing…

Descriptors: Equated Scores, Item Response Theory, Robustness (Statistics), Tables (Data)

A Comparison of Item Response Theory and Observed Score DIF Detection Measures for the Graded Response Model.

Download full text

Cohen, Allan S.; Kim, Seock-Ho; Wollack, James A. – 1998

This paper provides a review of procedures for detection of differential item functioning (DIF) for item response theory (IRT) and observed score methods for the graded response model. In addition, data from a test anxiety scale were analyzed to examine the congruence among these procedures. Data from Nasser, Takahashi, and Benson (1997) were…

Descriptors: Identification, Item Bias, Item Response Theory, Scores

Comparison of Two Logistic Multidimensional Item Response Theory Models. Research Report ONR90-8.

Download full text

Spray, Judith A.; And Others – 1990

Test data generated according to two different multidimensional item response theory (IRT) models were compared at both the item response level and the test score level to determine whether measurable differences between the models could be detected when the data sets were constrained to be equivalent in terms of item "p"-values. The…

Descriptors: Ability, Comparative Analysis, Item Response Theory, Mathematical Models

The Effectiveness of Enhancing Test Security by Using Multiple Item Pools. Research Report. ETS RR-05-19

Peer reviewed
PDF on ERIC

Download full text

Zhang, Jinming; Chang, Hua-Hua – ETS Research Report Series, 2005

This paper compares the use of multiple pools versus a single pool with respect to test security against large-scale item sharing among some examinees in a computer-based test, under the assumption that a randomized item selection method is used. It characterizes the conditions under which employing multiple pools is better than using a single…

Descriptors: Comparative Analysis, Test Items, Item Banks, Computer Assisted Testing

Ongoing Studies in Domain-Referenced Content Validity: First Look at the "Judgment" Issue.

Baker, Eva; Polin, Linda – 1978

The validity studies planned for the Test Design activities deal primarily with the appropriateness of items generated for a domain. Previous exploratory work in the field related to overall test content appropriateness ratings has not been satisfactory. Studies which are solely based on correlational data suffer from confounding with…

Descriptors: Questionnaires, Rating Scales, Test Construction, Test Format

Empirical Evaluation of Formulae for Correction of Item-Total Point-Biserial Correlations.

Peer reviewed

Berk, Ronald A. – Educational and Psychological Measurement, 1978

Three formulae developed to correct item-total correlations for spuriousness were evaluated. Relationships among corrected, uncorrected, and item-remainder correlations were determined by computing sets of mean, minimum, and maximum deviation coefficients and Spearman rank correlations for nine test lengths. (Author/JKS)

Descriptors: Correlation, Intermediate Grades, Item Analysis, Test Construction

Exam Question Exchange.

Peer reviewed

Alexander, John J., Ed. – Journal of Chemical Education, 1978

Two exam questions are presented. One suitable for advanced undergraduate or beginning graduate courses in organic chemistry, is on equivalent expressions for the description of several pericyclic reactions. The second, for general chemistry students, asks for an estimation of the rate of decay of a million-year-old Uranium-238 sample. (BB)

Descriptors: Chemistry, Evaluation, Higher Education, Problem Sets

Handling "Tied Items" When Using Lu's Method of Reliability Estimation

Peer reviewed

Huck, Schuyler W. – Educational and Psychological Measurement, 1978

A modification of Hoyt's analysis of variance model for test analysis was proposed by Lu. A difficulty that may be encountered in using Lu's modification is examined, and a solution is proposed. (JKS)

Descriptors: Analysis of Variance, Difficulty Level, Item Analysis, Test Items

The Problem of Varying Scales for G Index Generalizations

Peer reviewed

Vegelius, Jan – Educational and Psychological Measurement, 1977

Generalizations of the G index as a measure of similarity between persons beyond the dichotomous situation are discussed. An attempt is made to present a generalization that does not require dichotomization of the items for cases where the number of response alternatives may differ. (Author/JKS)

Descriptors: Correlation, Item Analysis, Measurement Techniques, Multidimensional Scaling

Demonstrating the Utility of the Standardization Approach to Assessing Unexpected Differential Item Performance on the Scholastic Aptitude Test.

Peer reviewed

Dorans, Neil J.; Kulick, Edward – Journal of Educational Measurement, 1986

The standardization method for assessing unexpected differential item performance or differential item functioning is introduced. Findings of five studies are summarized, in which the statistical method of standardization is used to look for unexpected differences in item performance across different subpopulations of the Scholastic Aptitude Test.…

Descriptors: Groups, Item Analysis, Sociometric Techniques, Standardized Tests

Unbiased Estimation in a Closed Sequential Testing Procedure.

Peer reviewed

Wilcox, Rand R. – Educational and Psychological Measurement, 1983

This article provides unbiased estimates of the proportion of items in an item domain that an examinee would answer correctly if every item were attempted, when a closed sequential testing procedure is used. (Author)

Descriptors: Estimation (Mathematics), Psychometrics, Scores, Sequential Approach

« Previous Page | Next Page »

Pages: 1 | ... | 333 | 334 | 335 | 336 | 337 | 338 | 339 | 340 | 341 | ... | 637

Educational and Psychological…	416
Journal of Educational…	367
ProQuest LLC	246
Applied Psychological…	234
Applied Measurement in…	231
ETS Research Report Series	146
Educational Measurement:…	128
Journal of Educational and…	122
Online Submission	115
International Journal of…	105
Grantee Submission	98
Language Testing	93
Psychometrika	93
International Journal of…	80
Journal of Psychoeducational…	72
Educational Assessment	70
Practical Assessment,…	60
Measurement:…	57
Language Assessment Quarterly	55
Journal of Chemical Education	54
Behavioral Research and…	50
Journal of Experimental…	45
Physical Review Physics…	38
Journal of Experimental…	36
International Journal of…	35
More ▼

Journal Articles	5882
Reports - Research	5592
Reports - Evaluative	1556
Speeches/Meeting Papers	1168
Reports - Descriptive	796
Tests/Questionnaires	768
Guides - Classroom - Teacher	472
Guides - Non-Classroom	259
Dissertations/Theses -…	251
Numerical/Quantitative Data	185
Information Analyses	179
Opinion Papers	164
Guides - Classroom - Learner	162
Books	54
Collected Works - General	33
Multilingual/Bilingual…	32
Guides - General	31
Reports - General	21
Book/Product Reviews	20
ERIC Publications	20
Non-Print Media	16
ERIC Digests in Full Text	14
Collected Works - Proceedings	13
Reference Materials - General	13
Collected Works - Serials	12
More ▼

No Child Left Behind Act 2001	36
Individuals with Disabilities…	20
Every Student Succeeds Act…	5
Elementary and Secondary…	4
Race to the Top	4
Rehabilitation Act 1973…	4
Elementary and Secondary…	3
Head Start	3
Americans with Disabilities…	2
Comprehensive Education…	2
Higher Education Act…	2
Immigration Reform and…	2
Civil Rights Act 1964	1
Civil Rights Act 1964 Title…	1
Comprehensive Employment and…	1
Education Consolidation…	1
Education for All Handicapped…	1
Fair Labor Standards Act	1
Higher Education Act Title II	1
Higher Education Opportunity…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Jeanne Clery Disclosure of…	1
Job Training Partnership Act…	1
Kentucky Education Reform Act…	1
More ▼

National Assessment of…	182
Program for International…	179
SAT (College Admission Test)	137
Trends in International…	114
Test of English as a Foreign…	85
Graduate Record Examinations	74
ACT Assessment	44
Advanced Placement…	34
Texas Educational Assessment…	32
Law School Admission Test	30
Wechsler Intelligence Scale…	26
Iowa Tests of Basic Skills	25
Progress in International…	25
Stanford Achievement Tests	24
Raven Progressive Matrices	22
Armed Services Vocational…	20
International English…	20
Peabody Picture Vocabulary…	20
California Achievement Tests	18
Comprehensive Tests of Basic…	18
Test of English for…	17
Metropolitan Achievement Tests	15
General Educational…	14
Graduate Management Admission…	14
Wechsler Adult Intelligence…	13
More ▼