ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	5

Descriptor

Comparative Analysis	15
Educational Assessment	15
Test Format	15
Test Items	7
Multiple Choice Tests	5
Test Construction	5
Foreign Countries	4
Performance Based Assessment	4
Student Evaluation	4
Alternative Assessment	3
Computer Assisted Testing	3
Correlation	3
Elementary Secondary Education	3
Test Use	3
Test Validity	3
Writing Evaluation	3
Computer Software	2
Constructed Response	2
Educational Change	2
Educational Technology	2
Educational Testing	2
English (Second Language)	2
Equated Scores	2
Essay Tests	2
Evaluation Methods	2
More ▼

Source

Educational Measurement:…	2
Assessment & Evaluation in…	1
International Journal of…	1
Journal of Experimental…	1
Journal of Technology,…	1
Language Testing in Asia	1
Online Submission	1
ProQuest LLC	1
Research in the Teaching of…	1

Publication Type

Journal Articles	9
Reports - Evaluative	7
Reports - Research	6
Dissertations/Theses -…	1
Information Analyses	1
Speeches/Meeting Papers	1

Education Level

Elementary Education	2
Elementary Secondary Education	2
Grade 6	1
Grade 8	1
Grade 9	1
Higher Education	1
Intermediate Grades	1
Middle Schools	1
Postsecondary Education	1

Audience

Location

Australia	1
Taiwan	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 15 results Save | Export

A Comparative Study of AI-Human-Made and Human-Made Test Forms for a University TESOL Theory Course

Peer reviewed

Direct link

Kyung-Mi O. – Language Testing in Asia, 2024

This study examines the efficacy of artificial intelligence (AI) in creating parallel test items compared to human-made ones. Two test forms were developed: one consisting of 20 existing human-made items and another with 20 new items generated with ChatGPT assistance. Expert reviews confirmed the content parallelism of the two test forms.…

Descriptors: Comparative Analysis, Artificial Intelligence, Computer Software, Test Items

Item-Reduction Methodologies for Complex Educational Assessments: A Comparative Methodological Exploration

Direct link

Lance M. Kruse – ProQuest LLC, 2019

This study explores six item-reduction methodologies used to shorten an existing complex problem-solving non-objective test by evaluating how each shortened form performs across three sources of validity evidence (i.e., test content, internal structure, and relationships with other variables). Two concerns prompted the development of the present…

Descriptors: Educational Assessment, Comparative Analysis, Test Format, Test Length

A Diagnostic Study of Computer Application of Structural Communication Grid

Download full text

Bahar, Mehmet; Aydin, Fatih; Karakirik, Erol – Online Submission, 2009

In this article, Structural communication grid (SCG), an alternative measurement and evaluation technique, has been firstly summarised and the design, development and implementation of a computer based SCG system have been introduced. The system is then tested on a sample of 154 participants consisting of candidate students, science teachers and…

Descriptors: Educational Technology, Technology Integration, Evaluation Methods, Measurement Techniques

Prior Degree and Student Assessment Performance: How Can Evidence Guide Decisions on Assessment Policy?

Peer reviewed

Direct link

Craig, Pippa; Gordon, Jill; Clarke, Rufus; Oldmeadow, Wendy – Assessment & Evaluation in Higher Education, 2009

This study aimed to provide evidence to guide decisions on the type and timing of assessments in a graduate medical programme, by identifying whether students from particular degree backgrounds face greater difficulty in satisfying the current assessment requirements. We examined the performance rank of students in three types of assessments and…

Descriptors: Student Evaluation, Medical Education, Student Characteristics, Correlation

A Comparison of Direct and Indirect Writing Assessment Methods.

Peer reviewed

Stiggins, Richard J. – Research in the Teaching of English, 1982

Compares direct and indirect writing assessment strategies and contrasts them in terms of the relationship each has to specific classroom decision-making situations, the components of writing assessed, practical testing matters, characteristics of test exercises, test scoring procedures, and procedures for determining test quality. (HOD)

Descriptors: Comparative Analysis, Decision Making, Educational Assessment, Test Format

Comparability of Scores from Performance Assessments.

Peer reviewed

Green, Bert F. – Educational Measurement: Issues and Practice, 1995

If annual performance assessments are to yield results that can be compared from year to year, many technical problems must be addressed. It is essential that tests to be equated measure the same construct. Methods of equating performance assessment scores, ways of equating system assessments, and standard setting are discussed. (SLD)

Descriptors: Comparative Analysis, Educational Assessment, Educational Change, Equated Scores

Cognitive Complexity and the Comparability of Multiple-Choice and Constructed-Response Test Formats.

Peer reviewed

Hancock, Gregory R. – Journal of Experimental Education, 1994

To investigate the ability of multiple-choice tests to assess higher order thinking skills, examinations were constructed as half multiple choice and half constructed response. Results with 90 undergraduate and graduate students indicate that the 2 formats measure similar constructs at different levels of complexity. (SLD)

Descriptors: Cognitive Processes, Comparative Analysis, Constructed Response, Educational Assessment

Problems and Issues in Linking Assessments across Languages.

Peer reviewed

Sireci, Stephen G. – Educational Measurement: Issues and Practice, 1997

Different methodologies for linking tests across languages are reviewed and evaluated, focusing on monolingual item response theory, bilingual group designs, and matched monolingual group designs. These methods, although not without weaknesses, are superior for promoting score comparability than methods that rely on translation or expert judgment…

Descriptors: Bilingualism, Comparative Analysis, Cross Cultural Studies, Educational Assessment

An Adaptive Algebra Test: A Testlet-Based, Hierarchically-Structured Test with Validity-Based Scoring. Technical Report No. 90-92.

Download full text

Wainer, Howard; And Others – 1990

The initial development of a testlet-based algebra test was previously reported (Wainer and Lewis, 1990). This account provides the details of this excursion into the use of hierarchical testlets and validity-based scoring. A pretest of two 15-item hierarchical testlets was carried out in which examinees' performance on a 4-item subset of each…

Descriptors: Adaptive Testing, Algebra, Comparative Analysis, Computer Assisted Testing

Characteristics Which Differentiate Criterion-Referenced from Norm-Referenced Tests.

Download full text

Klein, Thomas W. – 1990

Characteristics that distinguish criterion-referenced tests from their norm-referenced counterparts are discussed, including: the purposes that they are designed to serve; the characteristics of the types of items that they contain; and the manner in which they are developed. More specifically, the distinguishing characteristics include: reference…

Descriptors: Comparative Analysis, Criterion Referenced Tests, Differences, Educational Assessment

Toward an Operational Definition of Educational Performance Assessments.

Download full text

Finch, F. L.; Dost, Marcia A. – 1992

Many state and local entities are developing and using performance assessment programs. Because these initiatives are so diverse, it is very difficult to understand what they are doing, or to compare them in any meaningful way. Multiple-choice tests are contrasted with performance assessments, and preliminary classifications are suggested to…

Descriptors: Alternative Assessment, Classification, Comparative Analysis, Constructed Response

Some Issues in Free Response Testing.

Pollack, Judith M. – 1990

This paper summarizes an investigation of applications and issues in free response (FR) testing during 1989. It draws on ideas from the results of the National Educational Longitudinal Study 1988 (NELS:88) field test, a seminar series at the Educational Testing Service (ETS), working papers prepared for several FR testing applications, and…

Descriptors: Comparative Analysis, Costs, Educational Assessment, Elementary Secondary Education

The Development of Authentic Assessments to Investigate Ninth Graders' Scientific Literacy: In the Case of Scientific Cognition Concerning the Concepts of Chemistry and Physics

Peer reviewed

Direct link

Chang, Shu-Nu; Chiu, Mei-Hung – International Journal of Science and Mathematics Education, 2005

Scientific literacy and authenticity have gained a lot of attention in the past few decades worldwide. The goal of the study was to develop various authentic assessments to investigate students' scientific literacy for corresponding to the new curriculum reform of Taiwan in 1997. In the process, whether ninth graders were able to apply school…

Descriptors: Curriculum Development, Test Items, Educational Assessment, Scientific Principles

An Empirical Comparison of a Direct and an Indirect Method of Assessing Writing Proficiency.

Lombard, Juliana V. – 1988

The validity and reliability of two techniques of assessing writing proficiency were compared in a sample of 300 South African students (in Standard 8) for whom English was a second language. The objective, multiple choice method was compared with a subjective, essay method. Students completed a Standard English Second Language Item Bank Test and…

Descriptors: Comparative Analysis, Educational Assessment, English (Second Language), Essay Tests

Does It Matter if I Take My Writing Test on Computer? An Empirical Study of Mode Effects in NAEP

Peer reviewed
PDF on ERIC

Download full text

Horkay, Nancy; Bennett, Randy Elliott; Allen, Nancy; Kaplan, Bruce; Yan, Fred – Journal of Technology, Learning, and Assessment, 2006

This study investigated the comparability of scores for paper and computer versions of a writing test administered to eighth grade students. Two essay prompts were given on paper to a nationally representative sample as part of the 2002 main NAEP writing assessment. The same two essay prompts were subsequently administered on computer to a second…

Descriptors: Writing Evaluation, Writing Tests, Computer Assisted Testing, Program Effectiveness

Allen, Nancy	1
Aydin, Fatih	1
Bahar, Mehmet	1
Bennett, Randy Elliott	1
Chang, Shu-Nu	1
Chiu, Mei-Hung	1
Clarke, Rufus	1
Craig, Pippa	1
Dost, Marcia A.	1
Finch, F. L.	1
Gordon, Jill	1
Green, Bert F.	1
Hancock, Gregory R.	1
Horkay, Nancy	1
Kaplan, Bruce	1
Karakirik, Erol	1
Klein, Thomas W.	1
Kyung-Mi O.	1
Lance M. Kruse	1
Lombard, Juliana V.	1
Oldmeadow, Wendy	1
Pollack, Judith M.	1
Sireci, Stephen G.	1
Stiggins, Richard J.	1
More ▼