Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 8 |
Descriptor
Educational Testing | 15 |
Evaluation Methods | 4 |
Criteria | 3 |
Foreign Countries | 3 |
Guidelines | 3 |
Student Attitudes | 3 |
Test Items | 3 |
Testing Problems | 3 |
Computer Assisted Testing | 2 |
Cutting Scores | 2 |
Gender Differences | 2 |
More ▼ |
Source
International Journal of… | 15 |
Author
Evers, Arne | 3 |
Zilberberg, Anna | 2 |
Anderson, Robin D. | 1 |
Bartram, Dave | 1 |
Buckendahl, Chad W. | 1 |
Davis-Becker, Susan L. | 1 |
DeMars, Christine E. | 1 |
Eklof, Hanna | 1 |
Finney, Sara J. | 1 |
Gerrow, Jack | 1 |
Glas, Cees A. W. | 1 |
More ▼ |
Publication Type
Journal Articles | 15 |
Reports - Descriptive | 5 |
Reports - Research | 5 |
Reports - Evaluative | 3 |
Tests/Questionnaires | 2 |
Book/Product Reviews | 1 |
Guides - Non-Classroom | 1 |
Information Analyses | 1 |
Education Level
Grade 8 | 2 |
Higher Education | 2 |
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Postsecondary Education | 1 |
Secondary Education | 1 |
Audience
Location
Netherlands | 2 |
Indiana | 1 |
Maryland | 1 |
New Jersey | 1 |
Sweden | 1 |
Laws, Policies, & Programs
Assessments and Surveys
National Assessment of… | 1 |
What Works Clearinghouse Rating
Zilberberg, Anna; Finney, Sara J.; Marsh, Kimberly R.; Anderson, Robin D. – International Journal of Testing, 2014
Given worldwide prevalence of low-stakes testing for monitoring educational quality and students' progress through school (e.g., Trends in International Mathematics and Science Study, Program for International Student Assessment), interpretability of resulting test scores is of global concern. The nonconsequential nature of low-stakes tests…
Descriptors: Student Attitudes, Student Motivation, Test Validity, Accountability
Socha, Alan; DeMars, Christine E.; Zilberberg, Anna; Phan, Ha – International Journal of Testing, 2015
The Mantel-Haenszel (MH) procedure is commonly used to detect items that function differentially for groups of examinees from various demographic and linguistic backgrounds--for example, in international assessments. As in some other DIF methods, the total score is used to match examinees on ability. In thin matching, each of the total score…
Descriptors: Test Items, Educational Testing, Evaluation Methods, Ability Grouping
Ling, Guangming – International Journal of Testing, 2016
To investigate possible iPad related mode effect, we tested 403 8th graders in Indiana, Maryland, and New Jersey under three mode conditions through random assignment: a desktop computer, an iPad alone, and an iPad with an external keyboard. All students had used an iPad or computer for six months or longer. The 2-hour test included reading, math,…
Descriptors: Educational Testing, Computer Assisted Testing, Handheld Devices, Computers
Davis-Becker, Susan L.; Buckendahl, Chad W.; Gerrow, Jack – International Journal of Testing, 2011
Throughout the world, cut scores are an important aspect of a high-stakes testing program because they are a key operational component of the interpretation of test scores. One method for setting standards that is prevalent in educational testing programs--the Bookmark method--is intended to be a less cognitively complex alternative to methods…
Descriptors: Standard Setting (Scoring), Cutting Scores, Educational Testing, Licensing Examinations (Professions)
Puhan, Gautam – International Journal of Testing, 2011
This study examined the effect of including or excluding repeaters on the equating process and results. New forms of two tests were equated to their respective old forms using either all examinees or only the first timer examinees in the new form sample. Results showed that for both tests used in this study, including or excluding repeaters in the…
Descriptors: Equated Scores, Educational Testing, Student Evaluation, Sample Size
Evers, Arne; Sijtsma, Klaas; Lucassen, Wouter; Meijer, Rob R. – International Journal of Testing, 2010
This article describes the 2009 revision of the Dutch Rating System for Test Quality and presents the results of test ratings from almost 30 years. The rating system evaluates the quality of a test on seven criteria: theoretical basis, quality of the testing materials, comprehensiveness of the manual, norms, reliability, construct validity, and…
Descriptors: Rating Scales, Documentation, Educational Quality, Educational Testing

Bartram, Dave – International Journal of Testing, 2001
Describes the development of the International Guidelines for Test Use through the International Test Commission (ITC) project. These guidelines are designed to provide an international view of areas in which there is consensus on what constitutes good practice in testing. Describes why such guidelines are needed, and discusses their use. (SLD)
Descriptors: Educational Testing, Guidelines, Test Use
Eklof, Hanna – International Journal of Testing, 2007
This study explores the reported level of test-taking motivation and the relation between test-taking motivation and mathematics achievement in a sample of Swedish eighth-grade students (n = 343) participating in Trends in International Mathematics and Science Study (TIMSS) 2003. A majority of students reported that they were motivated to do their…
Descriptors: Males, Mathematics Achievement, Student Motivation, Student Attitudes

Oakland, Thomas; Poortinga, Ype H.; Schlegel, Justin; Hambleton, Ronald K. – International Journal of Testing, 2001
Traces the history of the International Test Commission (ITC), reviewing the context in which it was formed, its goals, and major milestones in its development. Suggests ways the ITC may continue to impact test development positively, and introduces this inaugural journal issue. (SLD)
Descriptors: Educational History, Educational Testing, International Education, Test Construction

Evers, Arne – International Journal of Testing, 2001
Describes the Dutch rating system for test quality, which evaluates a test for seven criteria, and analyses the results of test ratings from the past 18 years. Results show a steady increase in test quality in the Netherlands that can be attributed to use of better tests and declining use of tests of less quality after evaluation. (SLD)
Descriptors: Criteria, Educational Testing, Evaluation Methods, Foreign Countries

Evers, Arne – International Journal of Testing, 2001
Describes the 1997 revision of the Dutch Rating System for Test Quality used by a committee of the Dutch Association of Psychologists. The rating system evaluates test quality on seven criteria using a checklist for each criterion. Comment sections provide additional information, and weighting rules establish the final grades. (SLD)
Descriptors: Criteria, Educational Testing, Evaluation Methods, Foreign Countries
Zimmerman, Donald W.; Williams, Richard H.; Zumbo, Bruno D.; Ross, Donald – International Journal of Testing, 2005
This article focuses on Louis Guttman's contributions to the classical theory of educational and psychological tests, one of the lesser known of his many contributions to quantitative methods in the social sciences. Guttman's work in this field provided a rigorous mathematical basis for ideas that, for many decades after Spearman's initial work,…
Descriptors: Evaluation Methods, Test Theory, Social Sciences, Psychological Testing

International Journal of Testing, 2001
Contains guidelines that provide an international view of areas of consensus about what constitutes "good practice" in test use. Guidelines address key competencies, such as knowledge and skills, and issues of professional and ethical standards in testing, the rights of test takers, test administration and scoring, and other issues. (SLD)
Descriptors: Competence, Educational Practices, Educational Testing, Guidelines

Glas, Cees A. W. – International Journal of Testing, 2002
"Test Scoring" provides insight into psychometric procedures as used by a professional testing company or in large-scale projects. The book contains an overview of standard test theory, a discussion of factor analytic theory, and an exploration of special applications and problems. (SLD)
Descriptors: Educational Testing, Factor Analysis, Measurement Techniques, Psychometrics
International Journal of Testing, 2006
Developed by the International Test Commission, the International Guidelines on Computer-Based and Internet-Delivered Testing are a set of guidelines specifically developed to highlight good practice issues in relation to computer/Internet tests and testing. These guidelines have been developed from an international perspective and are directed at…
Descriptors: Guidelines, Computer Assisted Testing, Internet, Test Construction