ERIC - Search Results

Publication Date

In 2026	0
Since 2025	220
Since 2022 (last 5 years)	1089
Since 2017 (last 10 years)	2599
Since 2007 (last 20 years)	4960

Descriptor

Test Items	9552
Test Construction	2724
Foreign Countries	2185
Item Response Theory	1872
Difficulty Level	1624
Item Analysis	1502
Test Validity	1418
Test Reliability	1189
Multiple Choice Tests	1160
Scores	1137
Computer Assisted Testing	1058
Comparative Analysis	1024
Test Format	956
Higher Education	877
Achievement Tests	855
Statistical Analysis	852
Mathematics Tests	846
Psychometrics	835
Test Bias	772
Models	754
Student Evaluation	736
Language Tests	699
Correlation	695
Evaluation Methods	674
Scoring	633
More ▼

Author

van der Linden, Wim J.	69
Tindal, Gerald	50
Hambleton, Ronald K.	45
Alonzo, Julie	41
Chang, Hua-Hua	40
Plake, Barbara S.	40
Sinharay, Sandip	37
Reckase, Mark D.	36
Wainer, Howard	33
Dorans, Neil J.	32
Gierl, Mark J.	30
Sireci, Stephen G.	28
Wang, Wen-Chung	26
Cohen, Allan S.	25
Meijer, Rob R.	25
Samejima, Fumiko	24
Stocking, Martha L.	24
Anderson, Daniel	23
Zwick, Rebecca	23
Veldkamp, Bernard P.	22
Haladyna, Thomas M.	21
Kim, Seock-Ho	21
Wise, Steven L.	21
Kim, Sooyeon	20
More ▼

Education Level

Higher Education	1316
Postsecondary Education	1066
Secondary Education	928
Elementary Education	716
Middle Schools	421
High Schools	364
Elementary Secondary Education	359
Junior High Schools	321
Grade 8	256
Intermediate Grades	209
Grade 4	183
Early Childhood Education	178
Grade 5	134
Primary Education	126
Grade 7	113
Grade 3	111
Grade 6	107
Grade 9	70
Grade 2	56
Grade 10	53
Grade 12	52
Kindergarten	50
Adult Education	39
Grade 11	38
Grade 1	36
More ▼

Audience

Practitioners	653
Teachers	563
Researchers	250
Students	201
Administrators	81
Policymakers	22
Parents	17
Counselors	8
Community	7
Support Staff	3
Media Staff	1
More ▼

Location

Turkey	226
Canada	223
Australia	155
Germany	116
United States	99
China	90
Florida	86
Indonesia	82
Taiwan	78
United Kingdom	73
California	66
Japan	65
Netherlands	64
Iran	62
United Kingdom (England)	57
South Africa	48
New York	46
Missouri	45
Oklahoma	44
South Korea	44
Malaysia	42
Texas	42
Sweden	38
Israel	37
Singapore	37
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	4
Meets WWC Standards with or without Reservations	4
Does not meet standards	1

Showing 5,386 to 5,400 of 9,552 results Save | Export

Performance on Achievement Tests as a Function of the Order of Item Difficulty.

Peer reviewed

Gerow, Joshua R. – Teaching of Psychology, 1980

Discusses a study to evaluate how test design influences student performance in elementary psychology courses. Findings indicated that the order in which test items appeared on an exam was less significant with regard to student performance than the extent to which test items were well-written and contained some measure of content validity.…

Descriptors: Academic Achievement, Difficulty Level, Higher Education, Psychology

An Alternative Interpretation of Three Stability Models.

Peer reviewed

Wilcox, Rand R. – Educational and Psychological Measurement, 1979

Wilcox has described three probability models which characterize a single test item in terms of a population of examinees (ED 156 718). This note indicates indicates that similar models can be derived which characterize a single examinee in terms of an item domain. A numerical illustration is given. (Author/JKS)

Descriptors: Achievement Tests, Item Analysis, Mathematical Models, Probability

A New Procedure for Detection of Crossing DIF.

Peer reviewed

Li, Hsin-Hung; Stout, William – Psychometrika, 1996

A hypothesis testing and estimation procedure, Crossing SIBTEST, is presented for detecting crossing differential item functioning (DIF), which exists when the difference in probabilities of a correct answer for two examinee groups changes signs as ability level is varied. The procedure estimates the matching subtest score at which crossing…

Descriptors: Ability, Estimation (Mathematics), Hypothesis Testing, Item Bias

Development and Demonstration of Multidimensional IRT-Based Internal Measures of Differential Functioning of Items and Tests.

Peer reviewed

Oshima, T. C.; Raju, Nambury S. Rajo; Flowers, Claudia P. – Journal of Educational Measurement, 1997

Defines and demonstrates a framework for studying differential item functioning and differential test functioning for tests that are intended to be multidimensional. The procedure, which is illustrated with simulated data, is an extension of the unidimensional differential functioning of items and tests approach (N. Raju, W. van der Linden, and P.…

Descriptors: Item Bias, Item Response Theory, Models, Simulation

An Empirical Test of Roskam's Conjecture about the Interpretation of an ICC Parameter in Personality Inventories.

Peer reviewed

Zumbo, Bruno D.; Pope, Gregory A.; Watson, Jackie E.; Hubley, Anita M. – Educational and Psychological Measurement, 1997

E. Roskam's (1985) conjecture that steeper item characteristic curve (ICC) "a" parameters (slopes) (and higher item total correlations in classical test theory) would be found with more concretely worded test items was tested with results from 925 young adults on the Eysenck Personality Questionnaire (H. Eysenck and S. Eysenck, 1975).…

Descriptors: Correlation, Personality Assessment, Personality Measures, Test Interpretation

Computerized Adaptive Testing with Item Cloning.

Peer reviewed

Glas, Cees A. W.; van der Linden, Wim J. – Applied Psychological Measurement, 2003

Developed a multilevel item response (IRT) model that allows for differences between the distributions of item parameters of families of item clones. Results from simulation studies based on an item pool from the Law School Admission Test illustrate the accuracy of the item pool calibration and adaptive testing procedures based on the model. (SLD)

Descriptors: Adaptive Testing, Computer Assisted Testing, Item Banks, Item Response Theory

Examining Item Difficulty and Response Time on Perceptual Ability Test Items.

Peer reviewed

Yang, Chien-Lin; O'Neill, Thomas R.; Kramer, Gene A. – Journal of Applied Measurement, 2002

Studied item calibration stability in relation to response time and the levels of item difficulty between different response groups on a sample of 389 examinees responding to 6 subtest items of the Perceptual Ability Test of the Dental Admission Test. Results show that scores were equally useful for all groups, and different sources of item…

Descriptors: Ability, College Students, Dentistry, Difficulty Level

Development of a Procedure for Establishing Occupational Examination Cut Scores: A NOCTI Example.

Peer reviewed

Walter, Richard A.; Kapes, Jerome T. – Journal of Industrial Teacher Education, 2003

To identify a procedure for establishing cut scores for National Occupational Competency Testing Institute examinations in Pennsylvania, an expert panel assessed written and performance test items for minimally competent workers. Recommendations about the number, type, and training of judges used were made. (Contains 18 references.) (SK)

Descriptors: Cutting Scores, Interrater Reliability, Occupational Tests, Teacher Competency Testing

Managing the Influence of DIF from Big Items: The 1988 Advanced Placement History Test as an Example.

Peer reviewed

Wainer, Howard; Lukhele, Robert – Applied Measurement in Education, 1997

The screening for flaws done for multiple-choice items is often not done for large items. Examines continuous item weighting as a way to manage the influence of differential item functioning (DIF). Data from the College Board Advanced Placement History Test are used to illustrate the method. (SLD)

Descriptors: Advanced Placement, College Entrance Examinations, History, Item Bias

A Kernel-Smoothed Version of SIBTEST with Applications to Local DIF Inference and Function Estimation.

Peer reviewed

Douglas, Jeffrey A.; And Others – Journal of Educational and Behavioral Statistics, 1996

A procedure for detection of differential item functioning (DIF) is proposed that amalgamates SIBTEST and kernel-smoothed item response function estimation to assess DIF as a function of the latent trait theta that the test is designed to measure. Smoothed SIBTEST is studied through simulation and real data analysis. (SLD)

Descriptors: Ability, Equations (Mathematics), Estimation (Mathematics), Item Bias

The Effect of Using Item Parcels on Ad Hoc Goodness-of-Fit Indexes in Confirmatory Factor Analysis: An Example Using Sarason's Reactions to Tests.

Peer reviewed

Nasser, Fadia; Takahashi, Tomone – Applied Measurement in Education, 2003

Examined the impact of using item parcels on ad hoc goodness-of-fit indexes in confirmatory factor analysis using the Arabic version of Sarason's Reactions to Tests scale. Data from 421 and 372 Arabic speaking students at an Israeli high school show that lower skewness and kurtosis and higher validity occur for parcels than for individual items.…

Descriptors: Arabic, Foreign Countries, Goodness of Fit, High School Students

Does Item-Level DIF Manifest Itself in Scale-Level Analyses? Implications for Translating Language Tests.

Peer reviewed

Zumbo, Bruno D. – Language Testing, 2003

Based on the observation that scale-level methods are sometimes exclusively used to investigate measurement invariance for test translation, describes results of a simulation study investigating whether item-level differential item functioning (DIF) manifests itself in scale-level analyses such as single and multigroup factor analyses and per…

Descriptors: Factor Analysis, Item Analysis, Language Tests, Second Language Learning

Item Grouping Effects on Invariance of Attitude Items.

Peer reviewed

Frantom, Catherine; Green, Kathy E.; Lam, Tony C. M. – Journal of Applied Measurement, 2002

Studied the effects of item grouping on local independence and item invariance, the characteristics of items scaled under the Rasch model that make them sample-free. Data were 107 responses to a survey of teachers' opinions about the Ontario grade 9 literacy test. Although effects of grouping and item phrasing on invariance were found, results…

Descriptors: Attitude Measures, Attitudes, Foreign Countries, Groups

An Investigation of Factors Affecting Test Equating in Latent Trait Theory.

Peer reviewed

Sunathong, Surintorn; Schumacker, Randall E.; Beyerlein, Michael M. – Journal of Applied Measurement, 2000

Studied five factors that can affect the equating of scores from two tests onto a common score scale through the simulation and equating of 4,860 item data sets. Findings indicate three statistically significant two-way interactions for common item length and test length, item difficulty standard deviation and item distribution type, and item…

Descriptors: Difficulty Level, Equated Scores, Interaction, Item Response Theory

Some Observations on the Metric of PC-BILOG Results.

Peer reviewed

Baker, Frank B. – Applied Psychological Measurement, 1990

The equating of results from the PC-BILOG computer program to an underlying metric was studied through simulation when a two-parameter item response theory model was used. Results are discussed in terms of the identification problem and implications for test equating. (SLD)

Descriptors: Bayesian Statistics, Computer Simulation, Equated Scores, Item Response Theory

« Previous Page | Next Page »

Pages: 1 | ... | 356 | 357 | 358 | 359 | 360 | 361 | 362 | 363 | 364 | ... | 637

Educational and Psychological…	416
Journal of Educational…	367
ProQuest LLC	246
Applied Psychological…	234
Applied Measurement in…	231
ETS Research Report Series	146
Educational Measurement:…	128
Journal of Educational and…	122
Online Submission	115
International Journal of…	105
Grantee Submission	98
Language Testing	93
Psychometrika	93
International Journal of…	80
Journal of Psychoeducational…	72
Educational Assessment	70
Practical Assessment,…	60
Measurement:…	57
Language Assessment Quarterly	55
Journal of Chemical Education	54
Behavioral Research and…	50
Journal of Experimental…	45
Physical Review Physics…	40
Journal of Experimental…	36
International Journal of…	35
More ▼

Journal Articles	5887
Reports - Research	5597
Reports - Evaluative	1556
Speeches/Meeting Papers	1168
Reports - Descriptive	796
Tests/Questionnaires	768
Guides - Classroom - Teacher	472
Guides - Non-Classroom	259
Dissertations/Theses -…	251
Numerical/Quantitative Data	185
Information Analyses	179
Opinion Papers	164
Guides - Classroom - Learner	162
Books	54
Collected Works - General	33
Multilingual/Bilingual…	32
Guides - General	31
Reports - General	21
Book/Product Reviews	20
ERIC Publications	20
Non-Print Media	16
ERIC Digests in Full Text	14
Collected Works - Proceedings	13
Reference Materials - General	13
Collected Works - Serials	12
More ▼

No Child Left Behind Act 2001	36
Individuals with Disabilities…	20
Every Student Succeeds Act…	5
Elementary and Secondary…	4
Race to the Top	4
Rehabilitation Act 1973…	4
Elementary and Secondary…	3
Head Start	3
Americans with Disabilities…	2
Comprehensive Education…	2
Higher Education Act…	2
Immigration Reform and…	2
Civil Rights Act 1964	1
Civil Rights Act 1964 Title…	1
Comprehensive Employment and…	1
Education Consolidation…	1
Education for All Handicapped…	1
Fair Labor Standards Act	1
Higher Education Act Title II	1
Higher Education Opportunity…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Jeanne Clery Disclosure of…	1
Job Training Partnership Act…	1
Kentucky Education Reform Act…	1
More ▼

National Assessment of…	182
Program for International…	179
SAT (College Admission Test)	137
Trends in International…	114
Test of English as a Foreign…	85
Graduate Record Examinations	74
ACT Assessment	44
Advanced Placement…	34
Texas Educational Assessment…	32
Law School Admission Test	30
Wechsler Intelligence Scale…	26
Iowa Tests of Basic Skills	25
Progress in International…	25
Stanford Achievement Tests	24
Raven Progressive Matrices	22
Armed Services Vocational…	20
International English…	20
Peabody Picture Vocabulary…	20
California Achievement Tests	18
Comprehensive Tests of Basic…	18
Test of English for…	17
Metropolitan Achievement Tests	15
General Educational…	14
Graduate Management Admission…	14
Wechsler Adult Intelligence…	13
More ▼