ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	4

Descriptor

Comparative Analysis	24
Testing Problems	24
Scoring	18
Test Construction	7
Guessing (Tests)	6
Higher Education	6
Computer Assisted Testing	5
Educational Assessment	5
Multiple Choice Tests	5
Test Format	5
Test Interpretation	5
Test Reliability	5
Academic Achievement	4
Elementary Secondary Education	4
Evaluation Methods	4
Scoring Formulas	4
Test Items	4
Test Validity	4
Ability	3
Achievement Tests	3
College Students	3
Criterion Referenced Tests	3
Foreign Countries	3
Grading	3
Item Analysis	3
More ▼

Source

Computers & Education	2
ETS Research Report Series	1
Educational Measurement:…	1
Educational and Psychological…	1
Journal of Educational…	1
Journal of School Psychology	1
Journal of Teacher Education…	1
Review of Educational Research	1

Publication Type

Reports - Research	13
Journal Articles	6
Speeches/Meeting Papers	6
Guides - Non-Classroom	3
Reports - Descriptive	2
Reports - Evaluative	2
Tests/Questionnaires	2
Books	1
Information Analyses	1
Numerical/Quantitative Data	1

Education Level

Higher Education	2
Postsecondary Education	2
Secondary Education	1

Audience

Practitioners	1
Teachers	1

Location

Iran	1
Netherlands	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	2
Comprehensive Tests of Basic…	1
Graduate Management Admission…	1
Michigan Test of English…	1
Program for International…	1
SAT (College Admission Test)	1
Wechsler Preschool and…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 24 results Save | Export

Towards a Sustainable Curriculum for ESAP Teacher Training Program: A Profile of ESAP Content Specialists' vs. Language Instructors' Needs

Peer reviewed
PDF on ERIC

Download full text

Emadian, Farzaneh; Gholami, Javad; Sarkhosh, Mehdi – Journal of Teacher Education for Sustainability, 2018

The first and most crucial step towards developing a sustainable curriculum for instructors teaching English for Specific Academic Purposes (ESAP) is a needs analysis. Therefore, the main aim of conducting this study was to investigate the in-service needs of language instructors and content specialists teaching ESAP and to spot the differences…

Descriptors: English for Academic Purposes, Second Language Learning, Second Language Instruction, Inservice Teacher Education

Comparing Data Treatments on Item-Level Nonresponse and Their Effects on Data Analysis of Large-Scale Assessments: 2009 PISA Study. Research Report. ETS RR-15-12

Peer reviewed
PDF on ERIC

Download full text

Chen, Haiwen H.; von Davier, Matthias; Yamamoto, Kentaro; Kong, Nan – ETS Research Report Series, 2015

One major issue with large-scale assessments is that the respondents might give no responses to many items, resulting in less accurate estimations of both assessed abilities and item parameters. This report studies how the types of items affect the item-level nonresponse rates and how different methods of treating item-level nonresponses have an…

Descriptors: Achievement Tests, Foreign Countries, International Assessment, Secondary School Students

Comparison of Oral Examination and Electronic Examination Using Paired Multiple-Choice Questions

Peer reviewed

Direct link

Ventouras, Errikos; Triantis, Dimos; Tsiakas, Panagiotis; Stergiopoulos, Charalampos – Computers & Education, 2011

The aim of the present research was to compare the use of multiple-choice questions (MCQs) as an examination method against the oral examination (OE) method. MCQs are widely used and their importance seems likely to grow, due to their inherent suitability for electronic assessment. However, MCQs are influenced by the tendency of examinees to guess…

Descriptors: Grades (Scholastic), Scoring, Multiple Choice Tests, Test Format

Comparison of Examination Methods Based on Multiple-Choice Questions and Constructed-Response Questions Using Personal Computers

Peer reviewed

Direct link

Ventouras, Errikos; Triantis, Dimos; Tsiakas, Panagiotis; Stergiopoulos, Charalampos – Computers & Education, 2010

The aim of the present research was to compare the use of multiple-choice questions (MCQs) as an examination method, to the examination based on constructed-response questions (CRQs). Despite that MCQs have an advantage concerning objectivity in the grading process and speed in production of results, they also introduce an error in the final…

Descriptors: Computer Assisted Instruction, Scoring, Grading, Comparative Analysis

An Empirical Comparison of Two-Stage and Pyramidal Adaptive Ability Testing.

Download full text

Larkin, Kevin C.; Weiss, David J. – 1975

A 15-stage pyramidal test and a 40-item two-stage test were constructed and administered by computer to 111 college undergraduates. The two-stage test was found to utilize a smaller proportion of its potential score range than the pyramidal test. Score distributions for both tests were positively skewed but not significantly different from the…

Descriptors: Ability, Aptitude Tests, Comparative Analysis, Computer Programs

A Latent Trait Look at Pretest-Posttest Validation of Criterion-referenced Test Items.

Peer reviewed

van der Linden, Wim J. – Review of Educational Research, 1981

Using criterion-referenced test item data collected in an empirical study, differences in item selection between Cox and Vargas' pretest-posttest validity index and a latent trait approach (evaluation of the item information function for the mastery score) are analyzed. (Author/GK)

Descriptors: Comparative Analysis, Criterion Referenced Tests, Foreign Countries, Latent Trait Theory

A Theoretical Study of the Measurement Effectiveness of Flexilevel Tests

Peer reviewed

Lord, Frederic M. – Educational and Psychological Measurement, 1971

A number of empirical studies are suggested to answer certain questions in connection with flexilevel tests. (MS)

Descriptors: Comparative Analysis, Difficulty Level, Guessing (Tests), Item Analysis

A Comparison of the Validities of Conventional Choice Testing and Various Confidence Marking Procedures

Peer reviewed

Koehler, Roger A. – Journal of Educational Measurement, 1971

Descriptors: Achievement Tests, Comparative Analysis, Confidence Testing, Grade 11

A Comparison of Testlet-Based Test Designs for Computerized Adaptive Testing.

Download full text

Schnipke, Deborah L.; Reese, Lynda M. – 1997

Two-stage and multistage test designs provide a way of roughly adapting item difficulty to test-taker ability. All test takers take a parallel stage-one test, and, based on their scores, they are routed to tests of different difficulty levels in subsequent stages. These designs provide some of the benefits of standard computerized adaptive testing…

Descriptors: Ability, Adaptive Testing, Algorithms, Comparative Analysis

Comparability of Scores from Performance Assessments.

Peer reviewed

Green, Bert F. – Educational Measurement: Issues and Practice, 1995

If annual performance assessments are to yield results that can be compared from year to year, many technical problems must be addressed. It is essential that tests to be equated measure the same construct. Methods of equating performance assessment scores, ways of equating system assessments, and standard setting are discussed. (SLD)

Descriptors: Comparative Analysis, Educational Assessment, Educational Change, Equated Scores

A Study of Hypotheses Basic to the Use of Rights and Formula Scores. Phase I--Based on Experimental Administration of College Board Tests [and] Phase II--Based on Operational Administration of the GMAT.

Angoff, William H.; Schrader, William B. – 1982

In a study to determine whether a shift from Formula scoring to Rights scoring can be made without causing a discontinuity in the test scale, the analysis of special administrations of the Scholastic Aptitude Test and Chemistry Achievement Test and the variable section of an operational form of the Graduate Management Admission Test (GMAT) is…

Descriptors: Comparative Analysis, Equated Scores, Guessing (Tests), Higher Education

Technical Issues in Large-Scale Performance Assessment.

Download full text

Phillips, Gary W., Ed. – 1996

Recently, there has been a significant expansion in the use of performance assessment in large scale testing programs. Although there has been significant support from curriculum and policy stakeholders, the technical feasibility of large scale performance assessments has remained a question. This report is intended to contribute to the debate by…

Descriptors: Comparative Analysis, Generalizability Theory, Performance Based Assessment, Psychometrics

Analysis of Interrater Reliability on the Evaluation of Answers to Open-Ended Questions.

Crews, William E., Jr. – 1991

As part of a study of teacher evaluation of student replies to open-ended questions, a second question--the best method of determining interrater reliability--was examined. The standard method, the Pearson Product-Moment correlation, overestimated the degree of match between researchers' and teachers' scoring of tests. The simpler percent…

Descriptors: Comparative Analysis, Elementary School Teachers, Evaluation Methods, Evaluators

An Investigation of a Scoring Procedure Designed to Eliminate Score Variance Due to Guessing in Multiple-Choice Tests.

Download full text

Cross, Lawrence H. – 1975

A novel scoring procedure was investigated in order to obtain scores from a conventional multiple-choice test that would be free of the guessing component or contain a known guessing component even though examinees were permitted to guess at will. Scores computed with the experimental procedure are based not only on the number of items answered…

Descriptors: Algebra, Comparative Analysis, Guessing (Tests), High Schools

Scoring Difficulty of the WPPSI Geometric Design Subtest

Peer reviewed

Sattler, Jerome M. – Journal of School Psychology, 1976

The study investigated levels of agreement among graduate students (n=14) and school psychologists (n=18) in scoring drawings for the 10 designs on the WPPSI Geometric Design subtest. Considerable scoring disagreement occurred within each group. Results suggest careful study of the WPPSI scoring criteria is needed to achieve scoring proficiency.…

Descriptors: Comparative Analysis, Criteria, Criterion Referenced Tests, Elementary Secondary Education

Previous Page | Next Page »

Pages: 1 | 2

Horkay, Nancy, Ed.	2
Stergiopoulos, Charalampos	2
Triantis, Dimos	2
Tsiakas, Panagiotis	2
Ventouras, Errikos	2
Angoff, William H.	1
Calderone, John, Ed.	1
Chase, Clinton I.	1
Chen, Haiwen H.	1
Conklin, Jon	1
Crehan, Kevin	1
Crews, William E., Jr.	1
Cross, Lawrence H.	1
DeAyala, R. J.	1
Emadian, Farzaneh	1
Gholami, Javad	1
Green, Bert F.	1
Hisama, Kay K.	1
Jacobs, Lucy Cheser	1
King, Laura Mitchell, Ed.	1
Koch, William R.	1
Koehler, Roger A.	1
Kong, Nan	1
Larkin, Kevin C.	1
More ▼