ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	4

Descriptor

Test Format	70
Test Validity	70
Test Construction	36
Test Reliability	33
Test Items	23
Higher Education	17
Test Use	15
Foreign Countries	13
Language Tests	13
Multiple Choice Tests	12
Psychometrics	11
Testing Problems	11
Comparative Analysis	10
Comparative Testing	9
Factor Structure	8
Language Proficiency	8
Reading Tests	8
Scores	8
Standardized Tests	8
Adults	7
Computer Assisted Testing	7
Item Analysis	7
Reading Comprehension	7
High School Students	6
High Schools	6
More ▼

Source

Academic Medicine	1
College Board	1
Educational and Psychological…	1
Grantee Submission	1
International Association for…	1
Mathematics Education…	1
Psychological Assessment	1
Speech Education	1
TESOL Quarterly	1

Publication Type

Speeches/Meeting Papers	70
Reports - Research	38
Reports - Evaluative	21
Information Analyses	6
Journal Articles	5
Opinion Papers	4
Reports - Descriptive	3
Collected Works - General	1
Guides - Classroom - Teacher	1
Guides - Non-Classroom	1
Tests/Questionnaires	1
More ▼

Education Level

Elementary Education	1
Higher Education	1
Postsecondary Education	1

Audience

Researchers	4
Practitioners	3
Teachers	2
Administrators	1

Location

Netherlands	3
Australia	2
Canada	2
United Kingdom (Great Britain)	2
China	1
Japan	1
Puerto Rico	1
United Kingdom (England)	1
United Kingdom (Northern…	1
United Kingdom (Wales)	1
West Germany	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Beck Depression Inventory	3
Embedded Figures Test	2
Keymath Diagnostic Arithmetic…	2
Armed Services Vocational…	1
Bar Examinations	1
Conflict Tactics Scale	1
Iowa Tests of Basic Skills	1
National Assessment of…	1
National Teacher Examinations	1
SRA Achievement Series	1
Self Description Questionnaire	1
Stanford Achievement Tests	1
Wechsler Intelligence Scale…	1
Wide Range Achievement Test	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 70 results Save | Export

Validity Evidence for Forced-Choice and Mixed-Format Knowledge Assessments

Peer reviewed
PDF on ERIC

Download full text

Cari F. Herrmann Abell – Grantee Submission, 2021

In the last twenty-five years, the discussion surrounding validity evidence has shifted both in language and scope, from the work of Messick and Kane to the updated Standards for Educational and Psychological Testing. However, these discussions haven't necessarily focused on best practices for different types of instruments or assessments, taking…

Descriptors: Test Format, Measurement Techniques, Student Evaluation, Rating Scales

Computer Based Mathematics Assessment: Is It the Panacea?

Download full text

Rogers, Angela – Mathematics Education Research Group of Australasia, 2021

Test developers are continually exploring the possibilities Computer Based Assessment (CBA) offers the Mathematics domain. This paper describes the trial of the Place Value Assessment Tool (PVAT) and its online equivalent, the PVAT-O. Both tests were administered using a counterbalanced research design to 253 Year 3-6 students across nine classes…

Descriptors: Mathematics Tests, Computer Assisted Testing, Number Concepts, Elementary School Students

Statistical Measures of Integrity in Online Testing: Empirical Study

Download full text

Wielicki, Tom – International Association for Development of the Information Society, 2016

This paper reports on longitudinal study regarding integrity of testing in an online format as used by e-learning platforms. Specifically, this study explains whether online testing, which implies an open book format is compromising integrity of assessment by encouraging cheating among students. Statistical experiment designed for this study…

Descriptors: Integrity, Online Courses, Statistical Surveys, Longitudinal Studies

Developing Form Assembly Specifications for Exams with Multiple Choice and Constructed Response Items: Balancing Reliability and Validity Concerns

Download full text

Hendrickson, Amy; Patterson, Brian; Ewing, Maureen – College Board, 2010

The psychometric considerations and challenges associated with including constructed response items on tests are discussed along with how these issues affect the form assembly specifications for mixed-format exams. Reliability and validity, security and fairness, pretesting, content and skills coverage, test length and timing, weights, statistical…

Descriptors: Multiple Choice Tests, Test Format, Test Construction, Test Validity

Informal Reading Inventories: What Are They Really Asking?

Download full text

Allen, Diane D.; Swearingen, Rebecca A. – 1991

A study analyzed the validity of inferential, cause/effect, and main idea questions which were asked in five selected commercial informal reading inventories (IRIs). The inventories were "Analytical Reading Inventory (3rd Edition),""Basic Reading Inventory (4th Edition)"; "Burns and Roe Informal Reading Inventory (3rd…

Descriptors: Educational Research, Elementary Education, Informal Reading Inventories, Reading Comprehension

An Item Format Continuum for Classroom Assessment.

Download full text

Johanson, George; Motlomelo, Samuel – 1998

Many textbooks in educational measurement and classroom assessment have chapters devoted to specific item formats. There may be attempts to relate one item format to another, but the chapters and item formats are largely seem as distinct entities with only loose and uncertain connections. This paper synthesizes these discussions. An item format…

Descriptors: Educational Assessment, Essay Tests, Measurement Techniques, Objective Tests

Improving the Psychometric, Criterion-Referenced, and Practical Qualities of Integrative Language Tests.

Peer reviewed

Cziko, Gary A. – TESOL Quarterly, 1982

Describes an attempt to construct an ESL dictation test that would: (1) be appropriate for a wide range of ability, (2) be easy and fast to score, (3) consist of set items that would form both a unidimensional and cumulative scale, and (4) yield scores that would be directly interpretable with respect to specified levels of English proficiency.…

Descriptors: Criterion Referenced Tests, English (Second Language), Higher Education, Scores

Psychometric Evaluation of the Beck Depression Inventory-II.

Peer reviewed

Dozois, David J. A.; Ahnberg, Jamie L.; Dobson, Keith S. – Psychological Assessment, 1998

Provides psychometric information on the second edition of the Beck Depression Inventory (BDI-II) (A. Beck, R. Steer, and G. Brown, 1996) for internal consistency, factorial validity, and gender differences. Results indicate that the BDI-II is a stronger instrument than its predecessor in terms of factor structure. (SLD)

Descriptors: Depression (Psychology), Factor Analysis, Factor Structure, Psychometrics

Domain-Referenced Testing of Reading Achievement.

Brittain, Mary M.; Brittain, Clay V. – 1981

A behavioral domain is well-defined when it is clear to both test developers and test users which categories of performance should or should not be considered for potential test items. Only those tests that are keyed to well-defined domains meet the definition of criterion-referenced tests. The greatest proliferation of criterion-referenced tests…

Descriptors: Criterion Referenced Tests, Reading Achievement, Reading Tests, Test Construction

Sequential Testing with a Performance-Based Examination Using Standardized Patients.

Peer reviewed

Colliver, Jerry A.; And Others – Academic Medicine, 1991

A study assessed the feasibility of sequential testing of medical students using standardized patients. Sequential testing passes students who score well on the first segment of the test thus eliminating additional student-standardized patient encounters. Subjects were six classes of Southern Illinois University students (n=404). Results strongly…

Descriptors: Efficiency, Higher Education, Medical Education, Patients

A Missing Data Approach to Estimating Distributions of Scores for Optional Test Sections.

Allen, Nancy L.; And Others – 1992

Many testing programs include a section of optional questions in addition to mandatory parts of a test. These optional parts of a test are not often truly parallel to one another, and groups of examinees selecting each optional test section are not equivalent to one another. This paper provides a general method based on missing-data methods for…

Descriptors: Comparative Testing, Estimation (Mathematics), Graphs, Scaling

Oral Proficiency Testing in Special Circumstances: A Viable Alternative.

Reschke, Claus – 1983

An adaptation of the Foreign Service Institute's (FSI) oral interview test for oral language proficiency developed for use at the American Institute of Musical Studies' (AIMS) summer vocal institute in Austria to determine students' improvement in German language is discussed. The reasons for its selection over other major comparable tests are…

Descriptors: Achievement Gains, Interviews, Language Proficiency, Language Tests

Gender Differences in Adolescent Depression: Testing for Invariant Measurement and Structure for the BDI (French Version).

Download full text

Byrne, Barbara M.; And Others – 1992

A study of the Beck Depression Inventory (BDI) was conducted to: (1) test for the factorial validity of the French version of the BDI (BDI-FR) separately for 551 non-clinical Francophone adolescent males and 601 non-clinical Francophone adolescent females; (2) cross-validate findings across a second independent sample for each sex; and (3) test…

Descriptors: Adolescents, Depression (Psychology), Diagnostic Tests, Factor Structure

Issues of Candidate Perception in a Performance Test for Lawyers.

Download full text

Kunce, Charles S.; Arbet, Scott E. – 1994

The National Conference of Bar Examiners commissioned American College Testing, Inc., to help them in the development and evaluation of a performance test for use in bar admissions decisions. Because it was recognized that candidate perceptions would provide valuable information, a candidate-perception questionnaire was developed to be…

Descriptors: Attitudes, Demography, Languages, Lawyers

Developing and Improving the Quality of Written Tests.

Martin, Randy – 1988

Reasons for administering tests fall into two categories--decision-making and promoting learning. The two bases of tests are learning objectives and the level of learning at which training is developed. Test development involves a number of steps. The best way to tie objectives to test items is through the use of a table of specifications, which…

Descriptors: Elementary Secondary Education, Item Analysis, Item Banks, Postsecondary Education

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Allen, Nancy L.	2
Byrne, Barbara M.	2
Eignor, Daniel R.	2
Hambleton, Ronald K.	2
Melancon, Janet G.	2
Straus, Murray A.	2
Thompson, Bruce	2
Trevisan, Michael S.	2
Ahnberg, Jamie L.	1
Allen, Diane D.	1
Arbet, Scott E.	1
Auchter, Joan E.	1
Baron, Pierre	1
Benson, Jeri	1
Bolton, David L.	1
Braun, Carl	1
Brittain, Clay V.	1
Brittain, Mary M.	1
Brown, Annie	1
Brown, James Dean	1
Buser, Karen	1
Carcelli, Larry	1
Cari F. Herrmann Abell	1
Carlson, James E.	1
More ▼