ERIC - Search Results

Publication Date

In 2026	0
Since 2025	220
Since 2022 (last 5 years)	1089
Since 2017 (last 10 years)	2599
Since 2007 (last 20 years)	4960

Descriptor

Test Items	9552
Test Construction	2724
Foreign Countries	2185
Item Response Theory	1872
Difficulty Level	1624
Item Analysis	1502
Test Validity	1418
Test Reliability	1189
Multiple Choice Tests	1160
Scores	1137
Computer Assisted Testing	1058
Comparative Analysis	1024
Test Format	956
Higher Education	877
Achievement Tests	855
Statistical Analysis	852
Mathematics Tests	846
Psychometrics	835
Test Bias	772
Models	754
Student Evaluation	736
Language Tests	699
Correlation	695
Evaluation Methods	674
Scoring	633
More ▼

Author

van der Linden, Wim J.	69
Tindal, Gerald	50
Hambleton, Ronald K.	45
Alonzo, Julie	41
Chang, Hua-Hua	40
Plake, Barbara S.	40
Sinharay, Sandip	37
Reckase, Mark D.	36
Wainer, Howard	33
Dorans, Neil J.	32
Gierl, Mark J.	30
Sireci, Stephen G.	28
Wang, Wen-Chung	26
Cohen, Allan S.	25
Meijer, Rob R.	25
Samejima, Fumiko	24
Stocking, Martha L.	24
Anderson, Daniel	23
Zwick, Rebecca	23
Veldkamp, Bernard P.	22
Haladyna, Thomas M.	21
Kim, Seock-Ho	21
Wise, Steven L.	21
Kim, Sooyeon	20
More ▼

Education Level

Higher Education	1316
Postsecondary Education	1066
Secondary Education	928
Elementary Education	716
Middle Schools	421
High Schools	364
Elementary Secondary Education	359
Junior High Schools	321
Grade 8	256
Intermediate Grades	209
Grade 4	183
Early Childhood Education	178
Grade 5	134
Primary Education	126
Grade 7	113
Grade 3	111
Grade 6	107
Grade 9	70
Grade 2	56
Grade 10	53
Grade 12	52
Kindergarten	50
Adult Education	39
Grade 11	38
Grade 1	36
More ▼

Audience

Practitioners	653
Teachers	563
Researchers	250
Students	201
Administrators	81
Policymakers	22
Parents	17
Counselors	8
Community	7
Support Staff	3
Media Staff	1
More ▼

Location

Turkey	226
Canada	223
Australia	155
Germany	116
United States	99
China	90
Florida	86
Indonesia	82
Taiwan	78
United Kingdom	73
California	66
Japan	65
Netherlands	64
Iran	62
United Kingdom (England)	57
South Africa	48
New York	46
Missouri	45
Oklahoma	44
South Korea	44
Malaysia	42
Texas	42
Sweden	38
Israel	37
Singapore	37
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	4
Meets WWC Standards with or without Reservations	4
Does not meet standards	1

Showing 5,041 to 5,055 of 9,552 results Save | Export

Diagnostic Classification Models and Multidimensional Adaptive Testing: A Commentary on Rupp and Templin

Peer reviewed

Direct link

Frey, Andreas; Carstensen, Claus H. – Measurement: Interdisciplinary Research and Perspectives, 2009

On a general level, the objective of diagnostic classifications models (DCMs) lies in a classification of individuals regarding multiple latent skills. In this article, the authors show that this objective can be achieved by multidimensional adaptive testing (MAT) as well. The authors discuss whether or not the restricted applicability of DCMs can…

Descriptors: Adaptive Testing, Test Items, Classification, Psychometrics

Choosing: A Test. ETS Program Statistics Research.

Download full text

Wainer, Howard; Thissen, David – 1992

If examinees are permitted to choose to answer a subset of the questions on a test, just knowing which questions were chosen can provide a measure of proficiency that may be as reliable as would have been obtained from the test graded traditionally. This new method of scoring is much less time consuming and expensive for both the examinee and the…

Descriptors: Adaptive Testing, Cost Effectiveness, Responses, Scoring

A Cross-validation of the Design Components Influencing the Difficulty of Orthographic-Projection Spatial-Ability Items.

PDF pending restoration

Kramer, Gene A. – 1995

The present study is designed to cross-validate the findings of an earlier component analysis of orthographic-projection, spatial-ability items. The earlier research identified four design components that contribute to the difficulty of orthographic-projection items. The research found that increasing Rasch item difficulties on component…

Descriptors: Difficulty Level, Item Response Theory, Spatial Ability, Test Construction

Content Characteristics of GRE Analytical Reasoning Items. GRE Board Professional Report No. 84-14P.

Download full text

Chalifour, Clark; Powers, Donald E. – 1988

In actual test development practice, the number of test items that must be developed and pretested is typically greater, and sometimes much greater, than the number eventually judged suitable for use in operational test forms. This has proven to be especially true for analytical reasoning items, which currently form the bulk of the analytical…

Descriptors: Coding, Difficulty Level, Higher Education, Test Construction

Checking the Equivalence of Nearly Identical Test Editions.

Download full text

Dorans, Neil J.; Lawrence, Ida M. – 1988

A procedure for checking the score equivalence of nearly identical editions of a test is described. The procedure employs the standard error of equating (SEE) and utilizes graphical representation of score conversion deviation from the identity function in standard error units. Two illustrations of the procedure involving Scholastic Aptitude Test…

Descriptors: Equated Scores, Error of Measurement, Test Construction, Test Format

Effects of Item and Response Set Reversals on Survey Statistics.

Download full text

Barnette, J. Jackson – 1997

The controversy regarding reverse or negatively-worded survey stems has been around for several decades. The practice has been used to guard against acquiescent or response set behaviors. A 20-item, 5-point Likert item survey was designed and the stems and response sets were varied in a 2 by 3 design. One independent variable was type of item…

Descriptors: Likert Scales, Reliability, Responses, Statistical Analysis

A Compensatory Approach to Optimal Selection with Mastery Scores. Research Report 94-2.

Download full text

van der Linden, Wim J.; Vos, Hans J. – 1994

This paper presents some Bayesian theories of simultaneous optimization of decision rules for test-based decisions. Simultaneous decision making arises when an institution has to make a series of selection, placement, or mastery decisions with respect to subjects from a population. An obvious example is the use of individualized instruction in…

Descriptors: Bayesian Statistics, Decision Making, Foreign Countries, Scores

Evaluation of Procedures for Linking Multidimensional Item Calibrations.

Download full text

Oshima, T. C.; Davey, T. C. – 1994

This paper evaluated multidimensional linking procedures with which multidimensional test data from two separate calibrations were put on a common scale. Data were simulated with known ability distributions varying on two factors which made linking necessary: mean vector differences and variance-covariance (v-c) matrix differences. After the…

Descriptors: Ability, Estimation (Mathematics), Evaluation Methods, Matrices

Effect of a Modified Angoff Strategy for Obtaining Item Performance Estimates in a Standard Setting Study.

Download full text

Plake, Barbara S.; Giraud, Gerald – 1998

In the traditional Angoff Standard Setting Method, experts are instructed to predict the possibility that a randomly selected, hypothetical minimally competent candidate will be able to answer each multiple choice question in the test correctly. These item performance estimates are averaged across panelists and aggregated to determine the minimum…

Descriptors: Estimation (Mathematics), Evaluators, Performance Factors, Standard Setting (Scoring)

Equating Test Forms Composed of Testlets Using Dichotomous and Polytomous IRT Models.

Download full text

Lee, Guemin; Kolen, Michael J.; Frisbie, David A.; Ankenmann, Robert D. – 1998

Item response models can be applied in many test equating situations by making strong statistical assumptions. Thus, studying the robustness of the models to violations of the assumptions and investigating model-data fit are essential in all item response theory (IRT) equating applications (M. Kolen and R. Brennan, 1995). Previous studies dealing…

Descriptors: Equated Scores, Item Response Theory, Robustness (Statistics), Tables (Data)

A Comparison of Item Response Theory and Observed Score DIF Detection Measures for the Graded Response Model.

Download full text

Cohen, Allan S.; Kim, Seock-Ho; Wollack, James A. – 1998

This paper provides a review of procedures for detection of differential item functioning (DIF) for item response theory (IRT) and observed score methods for the graded response model. In addition, data from a test anxiety scale were analyzed to examine the congruence among these procedures. Data from Nasser, Takahashi, and Benson (1997) were…

Descriptors: Identification, Item Bias, Item Response Theory, Scores

Comparison of Two Logistic Multidimensional Item Response Theory Models. Research Report ONR90-8.

Download full text

Spray, Judith A.; And Others – 1990

Test data generated according to two different multidimensional item response theory (IRT) models were compared at both the item response level and the test score level to determine whether measurable differences between the models could be detected when the data sets were constrained to be equivalent in terms of item "p"-values. The…

Descriptors: Ability, Comparative Analysis, Item Response Theory, Mathematical Models

The Effectiveness of Enhancing Test Security by Using Multiple Item Pools. Research Report. ETS RR-05-19

Peer reviewed
PDF on ERIC

Download full text

Zhang, Jinming; Chang, Hua-Hua – ETS Research Report Series, 2005

This paper compares the use of multiple pools versus a single pool with respect to test security against large-scale item sharing among some examinees in a computer-based test, under the assumption that a randomized item selection method is used. It characterizes the conditions under which employing multiple pools is better than using a single…

Descriptors: Comparative Analysis, Test Items, Item Banks, Computer Assisted Testing

Ongoing Studies in Domain-Referenced Content Validity: First Look at the "Judgment" Issue.

Baker, Eva; Polin, Linda – 1978

The validity studies planned for the Test Design activities deal primarily with the appropriateness of items generated for a domain. Previous exploratory work in the field related to overall test content appropriateness ratings has not been satisfactory. Studies which are solely based on correlational data suffer from confounding with…

Descriptors: Questionnaires, Rating Scales, Test Construction, Test Format

Empirical Evaluation of Formulae for Correction of Item-Total Point-Biserial Correlations.

Peer reviewed

Berk, Ronald A. – Educational and Psychological Measurement, 1978

Three formulae developed to correct item-total correlations for spuriousness were evaluated. Relationships among corrected, uncorrected, and item-remainder correlations were determined by computing sets of mean, minimum, and maximum deviation coefficients and Spearman rank correlations for nine test lengths. (Author/JKS)

Descriptors: Correlation, Intermediate Grades, Item Analysis, Test Construction

« Previous Page | Next Page »

Pages: 1 | ... | 333 | 334 | 335 | 336 | 337 | 338 | 339 | 340 | 341 | ... | 637

Educational and Psychological…	416
Journal of Educational…	367
ProQuest LLC	246
Applied Psychological…	234
Applied Measurement in…	231
ETS Research Report Series	146
Educational Measurement:…	128
Journal of Educational and…	122
Online Submission	115
International Journal of…	105
Grantee Submission	98
Language Testing	93
Psychometrika	93
International Journal of…	80
Journal of Psychoeducational…	72
Educational Assessment	70
Practical Assessment,…	60
Measurement:…	57
Language Assessment Quarterly	55
Journal of Chemical Education	54
Behavioral Research and…	50
Journal of Experimental…	45
Physical Review Physics…	40
Journal of Experimental…	36
International Journal of…	35
More ▼

Journal Articles	5887
Reports - Research	5597
Reports - Evaluative	1556
Speeches/Meeting Papers	1168
Reports - Descriptive	796
Tests/Questionnaires	768
Guides - Classroom - Teacher	472
Guides - Non-Classroom	259
Dissertations/Theses -…	251
Numerical/Quantitative Data	185
Information Analyses	179
Opinion Papers	164
Guides - Classroom - Learner	162
Books	54
Collected Works - General	33
Multilingual/Bilingual…	32
Guides - General	31
Reports - General	21
Book/Product Reviews	20
ERIC Publications	20
Non-Print Media	16
ERIC Digests in Full Text	14
Collected Works - Proceedings	13
Reference Materials - General	13
Collected Works - Serials	12
More ▼

No Child Left Behind Act 2001	36
Individuals with Disabilities…	20
Every Student Succeeds Act…	5
Elementary and Secondary…	4
Race to the Top	4
Rehabilitation Act 1973…	4
Elementary and Secondary…	3
Head Start	3
Americans with Disabilities…	2
Comprehensive Education…	2
Higher Education Act…	2
Immigration Reform and…	2
Civil Rights Act 1964	1
Civil Rights Act 1964 Title…	1
Comprehensive Employment and…	1
Education Consolidation…	1
Education for All Handicapped…	1
Fair Labor Standards Act	1
Higher Education Act Title II	1
Higher Education Opportunity…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Jeanne Clery Disclosure of…	1
Job Training Partnership Act…	1
Kentucky Education Reform Act…	1
More ▼

National Assessment of…	182
Program for International…	179
SAT (College Admission Test)	137
Trends in International…	114
Test of English as a Foreign…	85
Graduate Record Examinations	74
ACT Assessment	44
Advanced Placement…	34
Texas Educational Assessment…	32
Law School Admission Test	30
Wechsler Intelligence Scale…	26
Iowa Tests of Basic Skills	25
Progress in International…	25
Stanford Achievement Tests	24
Raven Progressive Matrices	22
Armed Services Vocational…	20
International English…	20
Peabody Picture Vocabulary…	20
California Achievement Tests	18
Comprehensive Tests of Basic…	18
Test of English for…	17
Metropolitan Achievement Tests	15
General Educational…	14
Graduate Management Admission…	14
Wechsler Adult Intelligence…	13
More ▼