ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	6

Descriptor

Comparative Analysis	22
Test Interpretation	22
Test Items	22
Item Analysis	8
Test Construction	8
Equated Scores	7
Foreign Countries	6
Achievement Tests	5
Test Format	5
Test Validity	5
Testing Problems	5
Academic Achievement	4
Scores	4
Scoring	4
Statistical Analysis	4
College Entrance Examinations	3
Criterion Referenced Tests	3
Data Analysis	3
Difficulty Level	3
Higher Education	3
Item Response Theory	3
Measurement Techniques	3
Norm Referenced Tests	3
Questionnaires	3
Sampling	3
More ▼

Source

Educational and Psychological…	2
Journal of Educational…	2
Applied Psychological…	1
Assessment in Education:…	1
College Entrance Examination…	1
Educational Research and…	1
Journal of Counseling &…	1
Ministerial Council on…	1
National Center for Education…	1
Performance and Instruction	1
Studies in Higher Education	1
More ▼

Publication Type

Reports - Research	11
Journal Articles	10
Reports - Descriptive	5
Reports - Evaluative	5
Speeches/Meeting Papers	5
Numerical/Quantitative Data	1
Opinion Papers	1
Reports - General	1

Education Level

Elementary Secondary Education	3
Higher Education	3
Postsecondary Education	2
Elementary Education	1
Grade 10	1
Grade 4	1
Grade 6	1
Grade 8	1
Secondary Education	1

Audience

Location

Australia	1
Canada	1
China (Shanghai)	1
Israel	1
United Kingdom	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	3
California Achievement Tests	2
Test of English as a Foreign…	2
California Test of Mental…	1
General Aptitude Test Battery	1
SAT (College Admission Test)	1
Sequential Tests of…	1
Vocational Preference…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 22 results Save | Export

Historical Perspectives on Score Comparability Issues Raised by Innovations in Testing

Peer reviewed

Direct link

Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022

While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…

Descriptors: Scoring, Testing, Test Items, Test Format

PISA 2012: How Do Results for the Paper and Computer Tests Compare?

Peer reviewed

Direct link

Jerrim, John – Assessment in Education: Principles, Policy & Practice, 2016

The Programme for International Assessment (PISA) is an important cross-national study of 15-year olds academic achievement. Although it has traditionally been conducted using paper-and-pencil tests, the vast majority of countries will use computer-based assessment from 2015. In this paper, we consider how cross-country comparisons of children's…

Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students

Hit by a Perfect Storm? Art & Design in the National Student Survey

Peer reviewed

Direct link

Yorke, Mantz; Orr, Susan; Blair, Bernadette – Studies in Higher Education, 2014

There has long been the suspicion amongst staff in Art & Design that the ratings given to their subject disciplines in the UK's National Student Survey are adversely affected by a combination of circumstances--a "perfect storm". The "perfect storm" proposition is tested by comparing ratings for Art & Design with those…

Descriptors: Student Surveys, National Surveys, Art Education, Design

The Analysis of Measurement Equivalence in International Studies Using the Rasch Model

Peer reviewed

Direct link

Schulz, Wolfram; Fraillon, Julian – Educational Research and Evaluation, 2011

When comparing data derived from tests or questionnaires in cross-national studies, researchers commonly assume measurement invariance in their underlying scaling models. However, different cultural contexts, languages, and curricula can have powerful effects on how students respond in different countries. This article illustrates how the…

Descriptors: Citizenship Education, International Studies, Item Response Theory, International Education

Comparing TIMSS with NAEP and PISA in Mathematics and Science

Peer reviewed
PDF on ERIC

Download full text

National Center for Education Statistics, 2007

The purpose of this document is to provide background information that will be useful in interpreting the 2007 results from the Trends in International Mathematics and Science Study (TIMSS) by comparing its design, features, framework, and items with those of the U.S. National Assessment of Educational Progress and another international assessment…

Descriptors: National Competency Tests, Comparative Analysis, Achievement Tests, Test Items

Ohio Vocational Interest Survey (OVIS II), Microcomputer Version.

Peer reviewed

Hoyt, Kenneth B. – Journal of Counseling & Development, 1986

The microcomputer version of the Ohio Vocational Interest Survey (OVIS II) differs from the machine-scored version in its ability to incorporate data from the OVIS II:Career Planner in its printed report. It differs from the hand-scored version in its ability to include data from the OVIS II:Work Characteristic Analysis in its printed report.…

Descriptors: Comparative Analysis, Computer Assisted Testing, Microcomputers, Test Format

Some Exploratory Indices for Selection of a Test Equating Method.

Peer reviewed

Jaeger, Richard M. – Journal of Educational Measurement, 1981

Five indices are discussed that should logically discriminate between situations in which: (1) the linear equating method (LEM) adequately adjusts for difference between score distributions of two approximately parallel test forms; or (2) a method more complex than the linear equating method is needed. (RL)

Descriptors: College Entrance Examinations, Comparative Analysis, Difficulty Level, Equated Scores

Multidimensional Scaling vs. Factor Analysis of Tests and Items.

Download full text

Davison, Mark L. – 1981

Academic psychology has long been composed of two disciplines, one experimental and one correlational. These two disciplines each developed their own method of studying structure in data: multidimensional scaling (MDS) and factor analysis. Both methods use similar kinds of input data, proximity measures on object pairs. Both represent the object…

Descriptors: Ability, Comparative Analysis, Correlation, Factor Analysis

How Native Language and Level of English Proficiency Affect the Structure of the Test of English as a Foreign Language (TOEFL).

Download full text

Oltman, Philip K.; Stricker, Lawrence J. – 1988

A study examined the relationship of native language and level of English proficiency to the structure of the Test of English as a Foreign Language (TOEFL). Using all of the information provided by various responses to the test's items (the four alternatives, omitted, and not reached), the items' interrelations were analyzed by three-way…

Descriptors: Comparative Analysis, Construct Validity, English (Second Language), Language Proficiency

A Brief Report on a Comparison of Six Scoring Methods for Multiple True-False Items.

Peer reviewed

Tsai, Fu-Ju; Suen, Hoi K. – Educational and Psychological Measurement, 1993

Six methods of scoring multiple true-false items were compared in terms of reliabilities, difficulties, and discrimination. Results suggest that, for norm-referenced score interpretations, there is insufficient evidence to support any one of the methods as superior. For criterion-referenced score interpretations, effects of scoring method must be…

Descriptors: Comparative Analysis, Criterion Referenced Tests, Difficulty Level, Guessing (Tests)

Comparison of Finite State Score Theory, Classical Test Theory, and Item Response Theory in Scoring Multiple-Choice Items.

Peer reviewed

Ndalichako, Joyce L.; Rogers, W. Todd – Educational and Psychological Measurement, 1997

Ability estimates obtained from applying finite state score theory, item response models, and classical test theory to score multiple-choice items were compared using responses of 1,230 examinees. Scoring models provided essentially the same ranking of examinees, but ease of use and interpretation support the use of the classical test model. (SLD)

Descriptors: Ability, Comparative Analysis, Estimation (Mathematics), High School Students

Helping Teachers Interpret Item-Level Data: The New Hampshire Statewide Assessment.

Download full text

Cook, Nancy R.; Smith, Robert A. – 1999

New Hampshire has adopted a standards-based statewide assessment, the New Hampshire Educational Improvement and Assessment Program (NHEIAP), which is designed to measure students' learning against proficiency standards at grades 3, 6, and 10. Because of the difficulty teachers had in interpreting the NHEIAP results, a custom-designed software…

Descriptors: Academic Standards, Comparative Analysis, Computer Software, Data Analysis

A Process for Testing a Methematical Model for the Solution of a Practical Problem: Applications to Test Equating.

Douglass, James B. – 1979

A general process for testing the feasibility of applying alternative mathematical or statistical models to the solution of a practical problem is presented and flowcharted. The system is used to compare five models for test equating: (1) anchor test equating using classical test theory; (2) anchor test equating using the one-parameter logistic…

Descriptors: Comparative Analysis, Equated Scores, Flow Charts, Goodness of Fit

Tree versus Geometric Representation of Tests and Items.

Peer reviewed

Beller, Michael – Applied Psychological Measurement, 1990

Geometric approaches to representing interrelations among tests and items are compared with an additive tree model (ATM), using 2,644 examinees and 2 other data sets. The ATM's close fit to the data and its coherence of presentation indicate that it is the best means of representing tests and items. (TJH)

Descriptors: College Students, Comparative Analysis, Factor Analysis, Foreign Countries

An Overview of Criterion-Referenced Test Development.

Shrock, Sharon; And Others – Performance and Instruction, 1986

Presents major stages in design and development of criterion referenced tests (CRT) with emphasis on differences between CRT construction and norm-referenced test construction. Discussion covers test interpretation; test theory; preparation for test construction (hierarchical analysis, item type selection, and choosing number of items); test…

Descriptors: Adoption (Ideas), Comparative Analysis, Criterion Referenced Tests, Industrial Training

Previous Page | Next Page »

Pages: 1 | 2

Baldwin, Peter	1
Beller, Michael	1
Blair, Bernadette	1
Clarke, S. C. T.	1
Clauser, Brian E.	1
Cook, Nancy R.	1
Davison, Mark L.	1
Donovan, Jenny	1
Douglass, James B.	1
Fraillon, Julian	1
Haladyna, Tom	1
Hicks, Marilyn M.	1
Holley, Freda M.	1
Hoyt, Kenneth B.	1
Hutton, Penny	1
Jaeger, Richard M.	1
Jerrim, John	1
Lawrence, Ida M.	1
Lennon, Melissa	1
Meld, Andrea	1
Ndalichako, Joyce L.	1
Oltman, Philip K.	1
Orr, Susan	1
Rogers, W. Todd	1
More ▼