ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	7

Descriptor

Test Items	42
Testing Problems	42
Scoring	29
Test Construction	20
Multiple Choice Tests	12
Scoring Formulas	12
Computer Assisted Testing	10
Higher Education	10
Guessing (Tests)	9
Item Analysis	9
Difficulty Level	8
Adaptive Testing	7
Testing	7
Achievement Tests	6
Elementary Secondary Education	6
Test Bias	6
Latent Trait Theory	5
Mathematical Models	5
Psychometrics	5
Test Format	5
Test Reliability	5
Test Theory	5
Comparative Analysis	4
Equated Scores	4
Evaluation Methods	4
More ▼

Source

Educational Measurement:…	2
Educational and Psychological…	2
Applied Measurement in…	1
Applied Psychological…	1
Biochemical Education	1
ETS Research Report Series	1
Education Week	1
Educational Evaluation and…	1
English Teaching Forum	1
Evaluation in Education:…	1
Instructional Science	1
Journal of Educational…	1
Journal of Experimental…	1
National Council on…	1
Peabody Journal of Education	1
More ▼

Publication Type

Reports - Research	20
Journal Articles	14
Reports - Evaluative	13
Speeches/Meeting Papers	13
Collected Works - General	3
Guides - Non-Classroom	3
Reports - Descriptive	3
Information Analyses	2
Books	1
Guides - Classroom - Teacher	1
Numerical/Quantitative Data	1
More ▼

Education Level

Elementary Secondary Education	2
Higher Education	1
Postsecondary Education	1
Secondary Education	1

Audience

Researchers

Location

Germany

Laws, Policies, & Programs

Education for All Handicapped…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

Graduate Record Examinations	2
National Assessment of…	2
Advanced Placement…	1
Graduate Management Admission…	1
Program for International…	1
SAT (College Admission Test)	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 42 results Save | Export

Using Diagnostic Profiles to Describe Borderline Performance in Standard Setting

Peer reviewed

Direct link

Skaggs, Gary; Hein, Serge F.; Wilkins, Jesse L. M. – Educational Measurement: Issues and Practice, 2020

In test-centered standard-setting methods, borderline performance can be represented by many different profiles of strengths and weaknesses. As a result, asking panelists to estimate item or test performance for a hypothetical group study of borderline examinees, or a typical borderline examinee, may be an extremely difficult task and one that can…

Descriptors: Standard Setting (Scoring), Cutting Scores, Testing Problems, Profiles

It's Not Just Angoff: Misperceptions of Hard and Easy Items in Bookmark-Type Ratings

Peer reviewed

Direct link

Wyse, Adam E.; Babcock, Ben – Educational Measurement: Issues and Practice, 2020

A common belief is that the Bookmark method is a cognitively simpler standard-setting method than the modified Angoff method. However, a limited amount of research has investigated panelist's ability to perform well the Bookmark method, and whether some of the challenges panelists face with the Angoff method may also be present in the Bookmark…

Descriptors: Standard Setting (Scoring), Evaluation Methods, Testing Problems, Test Items

Comparing Data Treatments on Item-Level Nonresponse and Their Effects on Data Analysis of Large-Scale Assessments: 2009 PISA Study. Research Report. ETS RR-15-12

Peer reviewed
PDF on ERIC

Download full text

Chen, Haiwen H.; von Davier, Matthias; Yamamoto, Kentaro; Kong, Nan – ETS Research Report Series, 2015

One major issue with large-scale assessments is that the respondents might give no responses to many items, resulting in less accurate estimations of both assessed abilities and item parameters. This report studies how the types of items affect the item-level nonresponse rates and how different methods of treating item-level nonresponses have an…

Descriptors: Achievement Tests, Foreign Countries, International Assessment, Secondary School Students

A Competency Model for Process Dynamics and Control and Its Use for Test Construction at University Level

Peer reviewed

Direct link

Taskinen, Päivi H.; Steimel, Jochen; Gräfe, Linda; Engell, Sebastian; Frey, Andreas – Peabody Journal of Education, 2015

This study examined students' competencies in engineering education at the university level. First, we developed a competency model in one specific field of engineering: process dynamics and control. Then, the theoretical model was used as a frame to construct test items to measure students' competencies comprehensively. In the empirical…

Descriptors: Models, Engineering Education, Test Items, Outcome Measures

Twenty Common Testing Mistakes for EFL Teachers to Avoid

Download full text

Henning, Grant – English Teaching Forum, 2012

To some extent, good testing procedure, like good language use, can be achieved through avoidance of errors. Almost any language-instruction program requires the preparation and administration of tests, and it is only to the extent that certain common testing mistakes have been avoided that such tests can be said to be worthwhile selection,…

Descriptors: Testing, English (Second Language), Testing Problems, Student Evaluation

Open-Ended Test Items Pose Challenges

Direct link

Sawchuk, Stephen – Education Week, 2010

Most experts in the testing community have presumed that the $350 million promised by the U.S. Department of Education to support common assessments would promote those that made greater use of open-ended items capable of measuring higher-order critical-thinking skills. But as measurement experts consider the multitude of possibilities for an…

Descriptors: Test Items, Federal Legislation, Scoring, Accountability

Testing and Data Integrity in the Administration of Statewide Student Assessment Programs

Download full text

National Council on Measurement in Education, 2012

Testing and data integrity on statewide assessments is defined as the establishment of a comprehensive set of policies and procedures for: (1) the proper preparation of students; (2) the management and administration of the test(s) that will lead to accurate and appropriate reporting of assessment results; and (3) maintaining the security of…

Descriptors: State Programs, Integrity, Testing, Test Preparation

The Effect of Keying All Options Correct on Equating Functions and Scores.

Download full text

Lenel, Julia C.; Gilmer, Jerry S. – 1986

In some testing programs an early item analysis is performed before final scoring in order to validate the intended keys. As a result, some items which are flawed and do not discriminate well may be keyed so as to give credit to examinees no matter which answer was chosen. This is referred to as allkeying. This research examined how varying the…

Descriptors: Equated Scores, Item Analysis, Latent Trait Theory, Licensing Examinations (Professions)

Implicit Guessing Strategies of GRE-Aptitude Examinees Classified by Ethnic Group and Sex.

Download full text

Pike, Lewis W. – 1980

This study describes intergroup guessing differences in response to tests and to test-like tasks. It is a composite of seven component inquiries with three substudies in Phase 1 and four in Phase 2. These seven studies cover the Graduate Record Examination (GRE) item-type domain from a number of viewpoints relevant to implicit guessing behavior.…

Descriptors: Aptitude Tests, Black Students, College Entrance Examinations, Ethnic Groups

Objective-Referenced-Test Rescore Decisions and Item Statistics: A Matter of Congruence.

Shannon, Gregory A. – 1983

Rescoring of Center for Occupational and Professional Assessment objective-referenced tests is decided largely by content experts selected by client organizations. A few of the test items, statistically flagged for review, are not rescored. Some of this incongruence could be due to the use of the biserial correlation (r-biserial) as an…

Descriptors: Adults, Criterion Referenced Tests, Item Analysis, Occupational Tests

Maximum Likelihood Estimation of Item Response Parameters When Some Responses Are Omitted.

Download full text

Lord, Frederic M. – 1982

Explored are two theoretical approaches that attempt to cope with omitted responses, that is, when an examinee omits (fails to respond to) an item and therefore the item response formula cannot be used. Preliminary considerations are discussed, and it is shown that a conveniently simple application of equivalent items leads to internal…

Descriptors: Guessing (Tests), Latent Trait Theory, Mathematical Models, Maximum Likelihood Statistics

Differential Performance of Males and Females on Easy to Hard Item Arrangements; Influence of Feedback at the Item Level.

Plake, Barbara S.; And Others – 1983

Differential test performance by undergraduate males and females enrolled in a developmental educational psychology course (n=167) was reported on a quantitative examination as a function of item arrangement. Males were expected to perform better than females on tests whose items arranged easy to hard. Plake and Ansorge (1982) speculated this may…

Descriptors: Difficulty Level, Feedback, Higher Education, Scoring

Problems in Scoring, Agreement among Raters, and Internal Consistency of Selected Marker Tests.

Peer reviewed

Rusch, Reuben; Steiner, Judith – Journal of Experimental Education, 1979

The Selected Marker Tests were examined for scoring problems and internal consistency and were administered orally to sixth and seventh graders. Scoring problems were discovered and changes were suggested. The problem was found to be item reliability rather than interrater reliability. (Author/MH)

Descriptors: Cognitive Tests, Elementary Education, Item Analysis, Problem Solving

A Practitioner's Guide to Functional Level Testing.

Haenn, Joseph F. – 1981

Procedures for conducting functional level testing have been available for use by practitioners for some time. However, the Title I Evaluation and Reporting System (TIERS), developed in response to the educational amendments of 1974 to the Elementary and Secondary Education Act (ESEA), has provided the impetus for widespread adoption of this…

Descriptors: Achievement Tests, Difficulty Level, Scores, Scoring

Innovations in Computerized Assessment.

Drasgow, Fritz, Ed.; Olson-Buchanan, Julie B., Ed. – 1999

Chapters in this book present the challenges and dilemmas faced by researchers as they created new computerized assessments, focusing on issues addressed in developing, scoring, and administering the assessments. Chapters are: (1) "Beyond Bells and Whistles; An Introduction to Computerized Assessment" (Julie B. Olson-Buchanan and Fritz Drasgow);…

Descriptors: Adaptive Testing, Computer Assisted Testing, Elementary Secondary Education, Scoring

Previous Page | Next Page »

Pages: 1 | 2 | 3

Stocking, Martha L.	3
Lord, Frederic M.	2
Mills, Craig N.	2
Plake, Barbara S.	2
Wilcox, Rand R.	2
Angoff, William H.	1
Babcock, Ben	1
Bennett, Randy Elliot	1
Bhaskar, R.	1
Carlson, Sybil B.	1
Chen, Haiwen H.	1
Dillard, Jesse F.	1
Drasgow, Fritz, Ed.	1
Engell, Sebastian	1
Frary, Robert B.	1
Frey, Andreas	1
Gilmer, Jerry S.	1
Gräfe, Linda	1
Haenn, Joseph F.	1
Hein, Serge F.	1
Henning, Grant	1
Hutchinson, T.P.	1
Jaeger, Richard M.	1
Kong, Nan	1
More ▼