Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 8 |
Since 2006 (last 20 years) | 65 |
Descriptor
Test Construction | 530 |
Test Validity | 158 |
Elementary Secondary Education | 131 |
Testing Problems | 109 |
Standardized Tests | 82 |
Testing | 81 |
Student Evaluation | 77 |
Evaluation Methods | 75 |
Test Reliability | 71 |
Higher Education | 68 |
Test Use | 66 |
More ▼ |
Source
Author
Popham, W. James | 7 |
Green, Donald Ross | 6 |
Mislevy, Robert J. | 4 |
Choppin, Bruce | 3 |
Hamp-Lyons, Liz | 3 |
Hogan, Thomas P. | 3 |
Lowe, Pardee, Jr. | 3 |
Anastasi, Anne | 2 |
Bond, Lloyd | 2 |
Bracey, Gerald W. | 2 |
Briggs, Derek C. | 2 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 21 |
Higher Education | 5 |
Elementary Education | 4 |
Postsecondary Education | 3 |
Adult Education | 2 |
High Schools | 2 |
Grade 10 | 1 |
Grade 3 | 1 |
Grade 6 | 1 |
Grade 8 | 1 |
Middle Schools | 1 |
More ▼ |
Audience
Practitioners | 36 |
Researchers | 28 |
Teachers | 15 |
Administrators | 3 |
Media Staff | 1 |
Parents | 1 |
Location
Canada | 5 |
United Kingdom (Great Britain) | 5 |
United Kingdom | 4 |
California | 3 |
Netherlands | 3 |
New York | 3 |
United States | 3 |
Australia | 2 |
Japan | 2 |
United Kingdom (England) | 2 |
Africa | 1 |
More ▼ |
Laws, Policies, & Programs
Debra P v Turlington | 2 |
No Child Left Behind Act 2001 | 2 |
Fourteenth Amendment | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Leighton, Jacqueline P. – Applied Measurement in Education, 2021
The objective of this paper is to comment on the think-aloud methods presented in the three papers included in this special issue. The commentary offered stems from the author's own psychological investigations of unobservable information processes and the conditions under which the most defensible claims can be advanced. The structure of this…
Descriptors: Protocol Analysis, Data Collection, Test Construction, Test Validity
MacKenzie D. Sidwell; Landon W. Bonner; Kayla Bates-Brantley; Shengtian Wu – Intervention in School and Clinic, 2024
Oral reading fluency probes are essential for reading assessment, intervention, and progress monitoring. Due to the limited options for choosing oral reading fluency probes, it is important to utilize all available resources such as generative artificial intelligence (AI) like ChatGPT to create oral reading fluency probes. The purpose of this…
Descriptors: Artificial Intelligence, Natural Language Processing, Technology Uses in Education, Oral Reading
Wagner, Elvis; Krylova, Anna – Language Assessment Quarterly, 2021
When the COVID-19 pandemic made it impossible to do in-person, on campus testing, we were forced to create a new system to screen International Teaching Assistants (ITA) for Temple university. We used this opportunity to address many of the concerns and problems that we had identified with the previous test, and created a new test that could be…
Descriptors: Placement Tests, COVID-19, Pandemics, Computer Assisted Testing
Hirch, R. Roz – Language Assessment Quarterly, 2021
The following interview was conducted with Catherine Elder in spring of 2020, at the beginning of the pandemic. Cathie has had a varied career in language testing, including work at universities in Australia and New Zealand and at the Language Testing Resource Center in Melbourne. In this interview, Cathie shares some highlights of her somewhat…
Descriptors: Language Tests, Testing, Native Language Instruction, Second Language Instruction
Torres Irribarra, David – Measurement: Interdisciplinary Research and Perspectives, 2017
Maul's paper, "Rethinking Traditional Methods of Survey Validation," is a clever and pointed indictment of a set of specific but widespread practices in psychological measurement and the social sciences at large. Through it, Maul highlights central issues in the way to approach theory building and theory testing, bringing to mind the…
Descriptors: Surveys, Validity, Methods, Psychological Characteristics
Gafni, Naomi – Assessment in Education: Principles, Policy & Practice, 2016
Naomi Gafni, director of Research and Development, National Institute for Testing and Evaluation, Jerusalem, Israel, has devoted a substantial part of her career to the development of admissions tests and other educational tests and to the investigation of their validity. As such she is keenly aware of the complexities involved in this process.…
Descriptors: Test Validity, Test Interpretation, Test Use, Test Construction
Thissen, David – Measurement: Interdisciplinary Research and Perspectives, 2015
In "Adapting Educational Measurement to the Demands of Test-Based Accountability" Koretz takes the time-honored engineering approach to educational measurement, identifying specific problems with current practice and proposing minimal modifications of the system to alleviate those problems. In response to that article, David Thissen…
Descriptors: Educational Testing, Accountability, Testing Problems, Test Construction
Davenport, Ernest C.; Davison, Mark L.; Liou, Pey-Yan; Love, Quintin U. – Educational Measurement: Issues and Practice, 2016
The main points of Sijtsma and Green and Yang in Educational Measurement: Issues and Practice (34, 4) are that reliability, internal consistency, and unidimensionality are distinct and that Cronbach's alpha may be problematic. Neither of these assertions are at odds with Davenport, Davison, Liou, and Love in the same issue. However, many authors…
Descriptors: Educational Assessment, Reliability, Validity, Test Construction
West, Stephen G.; Grimm, Kevin J. – Measurement: Interdisciplinary Research and Perspectives, 2014
These authors agree with Bainter and Bollen that causal effects represents a useful measurement structure in some applications. The structure of the science of the measurement problem should determine the model; the measurement model should not determine the science. They also applaud Bainter and Bollen's important reminder that the full…
Descriptors: Causal Models, Measurement, Test Theory, Statistical Analysis
Mislevy, Robert J. – Measurement: Interdisciplinary Research and Perspectives, 2013
Measurement is a semantic frame, a constellation of relationships and concepts that correspond to recurring patterns in human activity, highlighting typical roles, processes, and viewpoints (e.g., the "commercial event") but not others. One uses semantic frames to reason about unique and complex situations--sometimes intuitively, sometimes…
Descriptors: Educational Assessment, Measurement, Feedback (Response), Evidence
Shepard, Lorrie A. – Measurement: Interdisciplinary Research and Perspectives, 2013
In his article, Haertel (this issue) asks a fundamental question about how use of a test is expected to cause improvements in the educational system and in learning. He also considers how test validity should be investigated and argues for a more expansive view of validity that does not stop with scoring or generalization (the more technical and…
Descriptors: Educational Testing, Test Validity, Test Results, Test Construction
Behizadeh, Nadia; Engelhard, George, Jr. – Measurement: Interdisciplinary Research and Perspectives, 2015
In his focus article, Koretz (this issue) argues that accountability has become the primary function of large-scale testing in the United States. He then points out that tests being used for accountability purposes are flawed and that the high-stakes nature of these tests creates a context that encourages score inflation. Koretz is concerned about…
Descriptors: Communities of Practice, High Stakes Tests, Testing, Test Validity
Skaggs, Gary – Measurement: Interdisciplinary Research and Perspectives, 2013
The construct map is a particularly good way to approach instrument development, and this author states that he was delighted to read Adam Wyse's thoughts about how to use construct maps for standard setting. For a number of popular standard-setting methods, Wyse shows how typical feedback to panelists fits within a construct map framework.…
Descriptors: Standard Setting (Scoring), Maps, Test Construction, Measurement
Johnson, Alyce O. – Journal of Psychoeducational Assessment, 2015
The "Parenting Stress Index, Fourth Edition" (PSI-4) is a 120-item measure used to explore parental stress levels considering a parent's relationship with one of his or her children between the ages of 1 month and 12 years. The main purpose of the test is to define these stress levels and from where they originate in order to identify…
Descriptors: Anxiety, Measures (Individuals), Parents, Child Rearing
Pollitt, Alastair – Measurement: Interdisciplinary Research and Perspectives, 2012
Paul E. Newton's article is valuable in many ways, especially for clarifying confusions and inconsistencies in the assessment business. Most importantly, he points out confusions that persist and where open discussion will help us understand what we say and what we mean to say. But I will focus here on the only faults I find in the article: three…
Descriptors: Validity, Evaluation, Definitions, Test Construction