ERIC - Search Results

Publication Date

In 2026	0
Since 2025	200
Since 2022 (last 5 years)	1070
Since 2017 (last 10 years)	2580
Since 2007 (last 20 years)	4941

Descriptor

Test Items	9533
Test Construction	2717
Foreign Countries	2181
Item Response Theory	1868
Difficulty Level	1620
Item Analysis	1501
Test Validity	1415
Test Reliability	1186
Multiple Choice Tests	1156
Scores	1136
Computer Assisted Testing	1057
Comparative Analysis	1024
Test Format	956
Higher Education	877
Achievement Tests	854
Statistical Analysis	850
Mathematics Tests	845
Psychometrics	832
Test Bias	770
Models	753
Student Evaluation	736
Language Tests	699
Correlation	695
Evaluation Methods	674
Scoring	633
More ▼

Author

van der Linden, Wim J.	69
Tindal, Gerald	50
Hambleton, Ronald K.	45
Alonzo, Julie	41
Chang, Hua-Hua	40
Plake, Barbara S.	40
Sinharay, Sandip	37
Reckase, Mark D.	36
Wainer, Howard	33
Dorans, Neil J.	32
Gierl, Mark J.	30
Sireci, Stephen G.	28
Wang, Wen-Chung	26
Cohen, Allan S.	25
Meijer, Rob R.	25
Samejima, Fumiko	24
Stocking, Martha L.	24
Anderson, Daniel	23
Zwick, Rebecca	23
Veldkamp, Bernard P.	22
Haladyna, Thomas M.	21
Kim, Seock-Ho	21
Wise, Steven L.	21
Kim, Sooyeon	20
More ▼

Education Level

Higher Education	1310
Postsecondary Education	1060
Secondary Education	925
Elementary Education	715
Middle Schools	419
High Schools	362
Elementary Secondary Education	358
Junior High Schools	319
Grade 8	255
Intermediate Grades	209
Grade 4	183
Early Childhood Education	177
Grade 5	134
Primary Education	126
Grade 7	113
Grade 3	111
Grade 6	107
Grade 9	68
Grade 2	56
Grade 10	52
Grade 12	52
Kindergarten	50
Adult Education	39
Grade 11	37
Grade 1	36
More ▼

Audience

Practitioners	653
Teachers	563
Researchers	250
Students	201
Administrators	81
Policymakers	22
Parents	17
Counselors	8
Community	7
Support Staff	3
Media Staff	1
More ▼

Location

Turkey	225
Canada	223
Australia	155
Germany	116
United States	99
China	90
Florida	86
Indonesia	82
Taiwan	78
United Kingdom	73
California	65
Japan	65
Netherlands	64
Iran	62
United Kingdom (England)	57
South Africa	48
Missouri	45
New York	45
Oklahoma	44
South Korea	44
Malaysia	42
Texas	42
Israel	37
Singapore	37
Sweden	37
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	4
Meets WWC Standards with or without Reservations	4
Does not meet standards	1

Showing 7,816 to 7,830 of 9,533 results Save | Export

The "Unbiased" Anchor: Bridging the Gap between DIF and Item Bias.

Peer reviewed

Williams, Valerie S. L. – Applied Measurement in Education, 1997

Using item response theory to investigate differential item functioning (DIF), students' expected course grades were examined and found to function similarly across sex and race. These grades were incorporated into the matching criterion, enhancing the validity of subgroup comparisons for the third-grade mathematics test taken by 1,050 students.…

Descriptors: Comparative Analysis, Criteria, Elementary School Students, Grade 3

On Agreement of Diagnostic Classifications from Parallel Subtests: Score Reliability at the Micro Level.

Peer reviewed

And Others; Birenbaum, Menucha – Educational and Psychological Measurement, 1997

The agreement of diagnostic classifications from two parallel subtests assessing a mathematics skill with three levels of scoring was studied with 431 Arab Israeli 10th graders. Results indicate that, even when parallel form reliability is high, less agreement is apparent when performance is evaluated at the micro level. (SLD)

Descriptors: Arabs, Classification, Diagnostic Tests, Evaluation Methods

Problems and Issues in Linking Assessments across Languages.

Peer reviewed

Sireci, Stephen G. – Educational Measurement: Issues and Practice, 1997

Different methodologies for linking tests across languages are reviewed and evaluated, focusing on monolingual item response theory, bilingual group designs, and matched monolingual group designs. These methods, although not without weaknesses, are superior for promoting score comparability than methods that rely on translation or expert judgment…

Descriptors: Bilingualism, Comparative Analysis, Cross Cultural Studies, Educational Assessment

Longitudinal Invariance of Self-Esteem and Method Effects Associated with Negatively Worded Items.

Peer reviewed

Motl, Robert W.; DiStefano, Christine – Structural Equation Modeling, 2002

Examined the longitudinal invariance of method effects associated with negatively worded items on a self-report measure of global self-esteem. Data from the National Educational Longitudinal Study for 3,950 junior high school and high school students show that the method effects associated with negatively worded items exhibit invariance across…

Descriptors: High School Students, High Schools, Junior High School Students, Junior High Schools

Written Feedback: Response Certitude and Durability.

Peer reviewed

Kulhavy, Raymond W.; And Others – Contemporary Educational Psychology, 1990

Assumptions of a servocontrol model of test item feedback were tested in a study of 94 junior and senior high school students receiving feedback or no feedback with 2 retention intervals. Results support the assumption that response certitude is related to the learner's ability to comprehend a given item. (SLD)

Descriptors: Comparative Testing, Feedback, High School Students, Junior High School Students

Computerized Test Construction.

Peer reviewed

Vockell, Edward L.; Hall, Jane – Social Studies, 1989

Examines the ways in which computers can assist teachers in developing good tests. Describes the program TESTWORKS in detail and provides charts comparing this program with 11 others in the areas of price, type of questions generated, computer functions, and the usefulness of each. Discusses the use of word processors and databases. (KO)

Descriptors: Adaptive Testing, Computer Assisted Testing, Computer Software, Computer Uses in Education

Using Computers to Analyze Item Response Data.

Peer reviewed

Hsu, Tse-chi; Yu, Lifa – Educational Measurement: Issues and Practice, 1989

How computers are used to analyze item data is reviewed, and the information that existing item-analysis programs provide is described. Summaries of studies comparing the performance of some of these packages reveal some of their current limitations. Emphasis is on the usefulness to educational practice of these packages. (SLD)

Descriptors: Computer Assisted Testing, Computer Software, Computer Software Reviews, Computer Uses in Education

Gender Differences in Item Performance and Predictive Validity on the DAT Quantitative Reasoning Test.

Peer reviewed

Smith, Richard M.; And Others – Journal of Dental Education, 1989

A study of gender bias in the Dental Admission Test's mathematics test and its validity in predicting dental school success found no significant difference between male and female performance and no significant difference in the predictive validity of items favoring males or females. (Author/MSE)

Descriptors: College Entrance Examinations, Dental Schools, Higher Education, Logical Thinking

Test Scrambling and Student Performance.

Peer reviewed

Gohmann, Stephan F.; Spector, Lee C. – Journal of Economic Education, 1989

Compares the effect of content ordering and scrambled ordering on examinations in courses, such as economics, that require quantitative skills. Empirical results suggest that students do no better if they are given a content-ordered rather than a scrambled examination as student performance is not adversely affected by scrambled ordered…

Descriptors: Cheating, Economics Education, Educational Research, Grading

IRT Ability Estimates from Customized Achievement Tests without Representative Content Sampling.

Peer reviewed

Way, Walter D.; And Others – Applied Measurement in Education, 1989

The effects of using item response theory (IRT) ability estimates based on customized tests formed by selecting areas from a nationally standardized achievement test were examined. For some populations, in some conditions, IRT ability estimates can be equivalent to scores based on full-length tests. (SLD)

Descriptors: Achievement Tests, Adaptive Testing, Content Validity, Elementary Education

A Comparison of Six Methods for Combining Multiple IRT Item Parameter Estimates.

Peer reviewed

McKinley, Robert L. – Journal of Educational Measurement, 1988

Six procedures for combining sets of item response theory (IRT) item parameter estimates from different samples were evaluated using real and simulated response data. Results support use of covariance matrix-weighted averaging and a procedure using sample-size-weighted averaging of estimated item characteristic curves at the center of the ability…

Descriptors: College Entrance Examinations, Comparative Analysis, Computer Simulation, Estimation (Mathematics)

A Comparative Item Analysis Study of a Language Testing Instrument.

Peer reviewed

Reynolds, Trudy; And Others – Language Testing, 1994

Presents a study conducted to provide a comparative analysis of five item analysis indices using both IRT and non-IRT indices to describe the characteristics of flagged items and to investigate the appropriateness of logistic regression as an item analysis technique for further studies. The performance of five item analysis indices was examined.…

Descriptors: College Students, Comparative Analysis, English (Second Language), Item Analysis

Content Validation of Key Features on a National Examination of Clinical Decision-Making Skills.

Peer reviewed

Bordage, Georges; And Others – Academic Medicine, 1995

Three related Canadian studies assessed the content validity of 59 clinical problems designed as part of a test of medical decision-making skills. Focus was on the key features, i.e., the critical or essential steps in identification and management of the clinical problem. Results support content validity of the key features. (MSE)

Descriptors: Clinical Teaching (Health Professions), Content Validity, Decision Making, Foreign Countries

Applying the Rasch Model to the Selection of Items for a Mental Ability Test.

Peer reviewed

Korashy, Abdel-Fattah El- – Educational and Psychological Measurement, 1995

The Rasch model was applied to selection of items for an Arabic version of the Otis-Lennon Mental Ability Test using a sample of 599 male and female Kuwaiti secondary school and university students. Results indicated that the test is suitable for the range of ability intended to be measured. (SLD)

Descriptors: Arabic, Cognitive Ability, College Students, Foreign Countries

Assessment and Feedback in Science Education.

Peer reviewed

Black, Paul – Studies in Educational Evaluation, 1995

The role of assessment in science education is explored, focusing on summative assessment in British public certificate examinations. Examples of test items are presented to illustrate difficulties in making valid and reliable assessments, and issues with implications for formative assessment are discussed. (SLD)

Descriptors: Educational Assessment, Feedback, Foreign Countries, Formative Evaluation

« Previous Page | Next Page »

Pages: 1 | ... | 518 | 519 | 520 | 521 | 522 | 523 | 524 | 525 | 526 | ... | 636

Educational and Psychological…	416
Journal of Educational…	359
ProQuest LLC	246
Applied Psychological…	234
Applied Measurement in…	231
ETS Research Report Series	146
Educational Measurement:…	128
Journal of Educational and…	122
Online Submission	115
International Journal of…	105
Grantee Submission	98
Language Testing	93
Psychometrika	93
International Journal of…	79
Journal of Psychoeducational…	72
Educational Assessment	70
Measurement:…	57
Practical Assessment,…	56
Language Assessment Quarterly	55
Journal of Chemical Education	54
Behavioral Research and…	50
Journal of Experimental…	45
Physical Review Physics…	38
Journal of Experimental…	36
International Journal of…	35
More ▼

Journal Articles	5869
Reports - Research	5578
Reports - Evaluative	1556
Speeches/Meeting Papers	1168
Reports - Descriptive	796
Tests/Questionnaires	768
Guides - Classroom - Teacher	472
Guides - Non-Classroom	259
Dissertations/Theses -…	251
Numerical/Quantitative Data	185
Information Analyses	179
Opinion Papers	164
Guides - Classroom - Learner	162
Books	54
Collected Works - General	33
Multilingual/Bilingual…	32
Guides - General	31
Reports - General	21
Book/Product Reviews	20
ERIC Publications	20
Non-Print Media	16
ERIC Digests in Full Text	14
Collected Works - Proceedings	13
Reference Materials - General	13
Collected Works - Serials	12
More ▼

No Child Left Behind Act 2001	36
Individuals with Disabilities…	20
Every Student Succeeds Act…	5
Elementary and Secondary…	4
Race to the Top	4
Rehabilitation Act 1973…	4
Elementary and Secondary…	3
Head Start	3
Americans with Disabilities…	2
Comprehensive Education…	2
Higher Education Act…	2
Immigration Reform and…	2
Civil Rights Act 1964	1
Civil Rights Act 1964 Title…	1
Comprehensive Employment and…	1
Education Consolidation…	1
Education for All Handicapped…	1
Fair Labor Standards Act	1
Higher Education Act Title II	1
Higher Education Opportunity…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Jeanne Clery Disclosure of…	1
Job Training Partnership Act…	1
Kentucky Education Reform Act…	1
More ▼

National Assessment of…	182
Program for International…	178
SAT (College Admission Test)	137
Trends in International…	114
Test of English as a Foreign…	85
Graduate Record Examinations	74
ACT Assessment	44
Advanced Placement…	34
Texas Educational Assessment…	32
Law School Admission Test	30
Wechsler Intelligence Scale…	26
Iowa Tests of Basic Skills	25
Progress in International…	25
Stanford Achievement Tests	24
Raven Progressive Matrices	22
Armed Services Vocational…	20
International English…	20
Peabody Picture Vocabulary…	20
California Achievement Tests	18
Comprehensive Tests of Basic…	18
Test of English for…	17
Metropolitan Achievement Tests	15
General Educational…	14
Graduate Management Admission…	14
Wechsler Adult Intelligence…	13
More ▼