ERIC - Search Results

Publication Date

In 2026	0
Since 2025	220
Since 2022 (last 5 years)	1089
Since 2017 (last 10 years)	2599
Since 2007 (last 20 years)	4960

Descriptor

Test Items	9552
Test Construction	2724
Foreign Countries	2185
Item Response Theory	1872
Difficulty Level	1624
Item Analysis	1502
Test Validity	1418
Test Reliability	1189
Multiple Choice Tests	1160
Scores	1137
Computer Assisted Testing	1058
Comparative Analysis	1024
Test Format	956
Higher Education	877
Achievement Tests	855
Statistical Analysis	852
Mathematics Tests	846
Psychometrics	835
Test Bias	772
Models	754
Student Evaluation	736
Language Tests	699
Correlation	695
Evaluation Methods	674
Scoring	633
More ▼

Author

van der Linden, Wim J.	69
Tindal, Gerald	50
Hambleton, Ronald K.	45
Alonzo, Julie	41
Chang, Hua-Hua	40
Plake, Barbara S.	40
Sinharay, Sandip	37
Reckase, Mark D.	36
Wainer, Howard	33
Dorans, Neil J.	32
Gierl, Mark J.	30
Sireci, Stephen G.	28
Wang, Wen-Chung	26
Cohen, Allan S.	25
Meijer, Rob R.	25
Samejima, Fumiko	24
Stocking, Martha L.	24
Anderson, Daniel	23
Zwick, Rebecca	23
Veldkamp, Bernard P.	22
Haladyna, Thomas M.	21
Kim, Seock-Ho	21
Wise, Steven L.	21
Kim, Sooyeon	20
More ▼

Education Level

Higher Education	1316
Postsecondary Education	1066
Secondary Education	928
Elementary Education	716
Middle Schools	421
High Schools	364
Elementary Secondary Education	359
Junior High Schools	321
Grade 8	256
Intermediate Grades	209
Grade 4	183
Early Childhood Education	178
Grade 5	134
Primary Education	126
Grade 7	113
Grade 3	111
Grade 6	107
Grade 9	70
Grade 2	56
Grade 10	53
Grade 12	52
Kindergarten	50
Adult Education	39
Grade 11	38
Grade 1	36
More ▼

Audience

Practitioners	653
Teachers	563
Researchers	250
Students	201
Administrators	81
Policymakers	22
Parents	17
Counselors	8
Community	7
Support Staff	3
Media Staff	1
More ▼

Location

Turkey	226
Canada	223
Australia	155
Germany	116
United States	99
China	90
Florida	86
Indonesia	82
Taiwan	78
United Kingdom	73
California	66
Japan	65
Netherlands	64
Iran	62
United Kingdom (England)	57
South Africa	48
New York	46
Missouri	45
Oklahoma	44
South Korea	44
Malaysia	42
Texas	42
Sweden	38
Israel	37
Singapore	37
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	4
Meets WWC Standards with or without Reservations	4
Does not meet standards	1

Showing 5,611 to 5,625 of 9,552 results Save | Export

Detection of Differential Item Functioning under the Graded Response Model with the Likelihood Ratio Test.

Peer reviewed

Kim, Seock-Ho; Cohen, Allan S. – Applied Psychological Measurement, 1998

Investigated Type I error rates of the likelihood-ratio test for the detection of differential item functioning (DIF) using Monte Carlo simulations under the graded-response model. Type I error rates were within theoretically expected values for all six combinations of sample sizes and ability-matching conditions at each of the nominal alpha…

Descriptors: Ability, Item Bias, Item Response Theory, Monte Carlo Methods

The Impact of Receiving the Same Items on Consecutive Computer Adaptive Test Administrations.

Peer reviewed

O'Neill, Thomas; Lunz, Mary E.; Thiede, Keith – Journal of Applied Measurement, 2000

Studied item exposure in a computerized adaptive test when the item selection algorithm presents examinees with questions they were asked in a previous test administration. Results with 178 repeat examinees on a medical technologists' test indicate that the combined use of an adaptive algorithm to select items and latent trait theory to estimate…

Descriptors: Adaptive Testing, Algorithms, Computer Assisted Testing, Item Response Theory

Effects of a Neutral Answer Choice on the Reliability and Validity of Attitude and Opinion Items.

Peer reviewed

Dassa, Clement; Lambert, Jean; Blais, Regis; Potvin, Diane; Gauthier, Natalie – Canadian Journal of Program Evaluation/La Revue canadienne d'evaluation de programme, 1997

Whether a middle alternative in the response choices to a questionnaire influences the reliability and validity of survey responses was studied with 1,390 physicians, nurses, and midwives. Including a neutral option had little effect on overall reliability and validity, but allowed better coherence when items were considered globally. (SLD)

Descriptors: Attitude Measures, Nurses, Obstetrics, Opinions

Three Response Types for Broadening the Conception of Mathematical Problem Solving in Computerized Tests.

Peer reviewed

Bennett, Randy Elliot; Morley, Mary; Quardt, Dennis – Applied Psychological Measurement, 2000

Describes three open-ended response types that could broaden the conception of mathematical problem solving used in computerized admissions tests: (1) mathematical expression (ME); (2) generating examples (GE); and (3) and graphical modeling (GM). Illustrates how combining ME, GE, and GM can form extended constructed response problems. (SLD)

Descriptors: Adaptive Testing, Computer Assisted Testing, Constructed Response, Mathematics Tests

Performance on Raven's Matrices by African and White University Students in South Africa.

Peer reviewed

Rushton, J. Philippe; Skuy, Mervyn – Intelligence, 2000

Administered untimed Raven's Standard Progressive Matrices (SPM) to 173 African and 136 White college students in South Africa. In comparison with the 1993 U.S. normative sample, African students scored at the 14th percentile, and White students at the 61st percentile. Differences were greater on SPM items with the highest item total correlations,…

Descriptors: Black Students, College Students, Correlation, Foreign Countries

Assessing Person-Fit on Measures of Typical Performance.

Peer reviewed

Reise, Steven P.; Flannery, Wm. Peter – Applied Measurement in Education, 1996

Statistical and theoretical issues that arise from assessing person-fit on measures of typical performance are discussed, including the frequent attenuation of detection of person-misfit, the need for methods of identifying sources of response aberrancy, and person-fit measures as moderators of trait-criterion relations. (SLD)

Descriptors: Item Response Theory, Measurement Techniques, Performance, Responses

Detecting DIF across the Different Language Groups in a Speaking test.

Peer reviewed

Kim, Mikyung – Language Testing, 2001

Investigates differential item functioning (DIF) across two different broad language groupings, Asian and European, in a speaking test in which the test takers' responses were rated polytomously. Data were collected from 1038 nonnative speakers of English from France, Hong Kong, Japan, Spain, Switzerland, and Thailand who took the SPEAK test in…

Descriptors: English (Second Language), Foreign Countries, Item Analysis, Language Tests

Using Feedback To Reduce Students' Judgment Bias on Test Questions.

Peer reviewed

Flannelly, Laura T. – Journal of Nursing Education, 2001

Between administrations of a test, 36 nursing students were given a practice test and answer key that provided feedback; 30 were not. Those who performed poorly on the test were more overconfident about answers to hard questions. This judgment bias can be reduced by providing feedback about their performance and confidence. (Contains 54…

Descriptors: Bias, Feedback, Higher Education, Nursing Education

Sequencing as an Item Type.

Peer reviewed

Alderson, J. Charles; Percsich, Richard; Szabo, Gabor – Language Testing, 2000

Reports on the potential problems in scoring responses to sequencing tests, the development of a computer program to overcome these difficulties, and an exploration of the value of scoring procedures. (Author/VWL)

Descriptors: Computer Software, Foreign Countries, Item Analysis, Language Tests

An Approach That Examines Sources of Misfit To Improve Performance Assessment Items and Rubrics.

Peer reviewed

Parke, Carol S. – Educational Assessment, 2001

Discusses an approach to analyzing performance assessments that identifies potential reasons for misfitting items and uses this information to improve on items and rubrics for these assessments. Illustrates the approach through a 53-item mathematics performance assessment completed by approximately 500 middle school students. (SLD)

Descriptors: Goodness of Fit, Mathematics Tests, Middle School Students, Middle Schools

A Comparative Study of the Angoff and Nedelsky Methods: Implications for Validity.

Peer reviewed

Subkoviak, Michael J.; Kane, Michael T.; Duncan, Patrick H. – Mid-Western Educational Researcher, 2002

Compares Angoff and Nedelsky methods for setting passing scores on tests. Using one of the methods, 84 college students were taught to estimate their probable scores on a vocabulary test. Estimates were compared to their later actual scores. The Nedelsky method was considerably less accurate under certain conditions, and both methods…

Descriptors: Cutting Scores, Difficulty Level, Evaluation Research, Test Construction

Using Response-Time Constraints To Control for Differential Speededness in Computerized Adaptive Testing.

Peer reviewed

van der Linden, Wim J.; Scrams, David J.; Schnipke, Deborah L. – Applied Psychological Measurement, 1999

Proposes an item-selection algorithm for neutralizing the differential effects of time limits on computerized adaptive test scores. Uses a statistical model for distributions of examinees' response times on items in a bank that is updated each time an item is administered. Demonstrates the method using an item bank from the Armed Services…

Descriptors: Adaptive Testing, Algorithms, Computer Assisted Testing, Item Banks

Revising Item Responses in Computerized Adaptive Tests: A Comparison of Three Models.

Peer reviewed

Stocking, Martha L. – Applied Psychological Measurement, 1997

Investigated three models that permit restricted examinee control over revising previous answers in the context of adaptive testing, using simulation. Two models permitting item revisions worked well in preserving test fairness and accuracy, and one model may preserve some cognitive processing styles developed by examinees for a linear testing…

Descriptors: Adaptive Testing, Cognitive Processes, Comparative Analysis, Computer Assisted Testing

An Examination of Item Context Effects, DIF, and Gender DIF.

Peer reviewed

Ryan, Katherine E.; Chiu, Shuwan – Applied Measurement in Education, 2001

Examined whether patterns of gender differential item functioning (DIF) in parcels of items are influenced by changes in item position. Findings for more than 2,000 college freshmen taking a test of mathematics suggest that the amounts of gender DIF and DIF present in item parcels tend not to be influenced by changes in item position. (SLD)

Descriptors: College Freshmen, Context Effect, Higher Education, Item Bias

An Effective Approach for Test-Sheet Composition with Large-Scale Item Banks

Peer reviewed

Direct link

Hwang, Gwo-Jen; Lin, Bertrand M. T.; Lin, Tsung-Liang – Computers and Education, 2006

A well-constructed test sheet not only helps the instructor evaluate the learning status of the students, but also facilitates the diagnosis of the problems embedded in the students' learning process. This paper addresses the problem of selecting proper test items to compose a test sheet that conforms to such assessment requirements as average…

Descriptors: Test Items, Item Banks, Student Evaluation, Difficulty Level

« Previous Page | Next Page »

Pages: 1 | ... | 371 | 372 | 373 | 374 | 375 | 376 | 377 | 378 | 379 | ... | 637

Educational and Psychological…	416
Journal of Educational…	367
ProQuest LLC	246
Applied Psychological…	234
Applied Measurement in…	231
ETS Research Report Series	146
Educational Measurement:…	128
Journal of Educational and…	122
Online Submission	115
International Journal of…	105
Grantee Submission	98
Language Testing	93
Psychometrika	93
International Journal of…	80
Journal of Psychoeducational…	72
Educational Assessment	70
Practical Assessment,…	60
Measurement:…	57
Language Assessment Quarterly	55
Journal of Chemical Education	54
Behavioral Research and…	50
Journal of Experimental…	45
Physical Review Physics…	40
Journal of Experimental…	36
International Journal of…	35
More ▼

Journal Articles	5887
Reports - Research	5597
Reports - Evaluative	1556
Speeches/Meeting Papers	1168
Reports - Descriptive	796
Tests/Questionnaires	768
Guides - Classroom - Teacher	472
Guides - Non-Classroom	259
Dissertations/Theses -…	251
Numerical/Quantitative Data	185
Information Analyses	179
Opinion Papers	164
Guides - Classroom - Learner	162
Books	54
Collected Works - General	33
Multilingual/Bilingual…	32
Guides - General	31
Reports - General	21
Book/Product Reviews	20
ERIC Publications	20
Non-Print Media	16
ERIC Digests in Full Text	14
Collected Works - Proceedings	13
Reference Materials - General	13
Collected Works - Serials	12
More ▼

No Child Left Behind Act 2001	36
Individuals with Disabilities…	20
Every Student Succeeds Act…	5
Elementary and Secondary…	4
Race to the Top	4
Rehabilitation Act 1973…	4
Elementary and Secondary…	3
Head Start	3
Americans with Disabilities…	2
Comprehensive Education…	2
Higher Education Act…	2
Immigration Reform and…	2
Civil Rights Act 1964	1
Civil Rights Act 1964 Title…	1
Comprehensive Employment and…	1
Education Consolidation…	1
Education for All Handicapped…	1
Fair Labor Standards Act	1
Higher Education Act Title II	1
Higher Education Opportunity…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Jeanne Clery Disclosure of…	1
Job Training Partnership Act…	1
Kentucky Education Reform Act…	1
More ▼

National Assessment of…	182
Program for International…	179
SAT (College Admission Test)	137
Trends in International…	114
Test of English as a Foreign…	85
Graduate Record Examinations	74
ACT Assessment	44
Advanced Placement…	34
Texas Educational Assessment…	32
Law School Admission Test	30
Wechsler Intelligence Scale…	26
Iowa Tests of Basic Skills	25
Progress in International…	25
Stanford Achievement Tests	24
Raven Progressive Matrices	22
Armed Services Vocational…	20
International English…	20
Peabody Picture Vocabulary…	20
California Achievement Tests	18
Comprehensive Tests of Basic…	18
Test of English for…	17
Metropolitan Achievement Tests	15
General Educational…	14
Graduate Management Admission…	14
Wechsler Adult Intelligence…	13
More ▼