Publication Date
In 2025 | 2 |
Since 2024 | 12 |
Since 2021 (last 5 years) | 44 |
Since 2016 (last 10 years) | 88 |
Since 2006 (last 20 years) | 205 |
Descriptor
Test Construction | 470 |
Test Validity | 368 |
Test Reliability | 172 |
Test Items | 89 |
Higher Education | 72 |
Validity | 69 |
Elementary Secondary Education | 67 |
Language Tests | 67 |
Student Evaluation | 63 |
Evaluation Methods | 62 |
Psychometrics | 62 |
More ▼ |
Source
Author
Stansfield, Charles W. | 10 |
Gong, Brian | 4 |
Kenyon, Dorry Mann | 4 |
Baker, Eva L. | 3 |
Herman, Joan L. | 3 |
Ketterlin-Geller, Leanne R. | 3 |
Liu, Kimy | 3 |
O'Reilly, Tenaha | 3 |
Sabatini, John | 3 |
Straus, Murray A. | 3 |
Brown, James Dean | 2 |
More ▼ |
Publication Type
Education Level
Audience
Practitioners | 27 |
Teachers | 23 |
Researchers | 14 |
Administrators | 11 |
Policymakers | 5 |
Counselors | 2 |
Location
Australia | 9 |
Canada | 9 |
New York | 7 |
Nebraska | 5 |
Texas | 5 |
Japan | 4 |
Netherlands | 4 |
Tennessee | 4 |
United Kingdom | 4 |
California | 3 |
Florida | 3 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 4 |
Every Student Succeeds Act… | 3 |
Comprehensive Education… | 2 |
Elementary and Secondary… | 2 |
Kentucky Education Reform Act… | 1 |
Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Anne Traynor; Sara C. Christopherson – Applied Measurement in Education, 2024
Combining methods from earlier content validity and more contemporary content alignment studies may allow a more complete evaluation of the meaning of test scores than if either set of methods is used on its own. This article distinguishes item relevance indices in the content validity literature from test representativeness indices in the…
Descriptors: Test Validity, Test Items, Achievement Tests, Test Construction
Denise Swanson; Gerald Tindal – Behavioral Research and Teaching, 2024
This technical report provides an authoritative bibliographic resource of all the studies conducted on "easyCBM"® and published on the main website for Behavioral Research and Teaching under Publications (https://brtprojects.org). The "easyCBM"© software is a direct descendent of "Curriculum-based Measurement" (CBM)…
Descriptors: Bibliographies, Computer Software, Test Construction, Test Reliability
National Institute for Excellence in Teaching, 2023
Aspiring teachers must develop an in-depth understanding of high-quality instructional practices. In order to prepare, instruct, and coach aspiring teachers, the National Institute for Excellence in Teaching (NIET) has developed a the NIET Aspiring Teacher Rubric (ATR) based on principles of excellence in instruction. This research brief…
Descriptors: Scoring Rubrics, Preservice Teachers, Test Construction, Test Validity
Philipp Sterner; Kim De Roover; David Goretzko – Structural Equation Modeling: A Multidisciplinary Journal, 2025
When comparing relations and means of latent variables, it is important to establish measurement invariance (MI). Most methods to assess MI are based on confirmatory factor analysis (CFA). Recently, new methods have been developed based on exploratory factor analysis (EFA); most notably, as extensions of multi-group EFA, researchers introduced…
Descriptors: Error of Measurement, Measurement Techniques, Factor Analysis, Structural Equation Models
Edith C. J. Roefs; Ida E. Oosterheert; Yvonne A. M. Leeman; William M. van der Veld; Paulien C. Meijer – International Journal of Research & Method in Education, 2024
This paper provides a detailed account of the development of an instrument to investigate the emerging concept of presence in teaching (PiT) on a larger scale, explaining how the transition from data and findings from qualitative studies to a survey instrument was accomplished. In order to ensure relevance for teachers, the instrument needed to do…
Descriptors: Test Construction, Interaction, Educational Research, Teacher Student Relationship
Meike Akveld; George Kinnear – International Journal of Mathematical Education in Science and Technology, 2024
Many universities use diagnostic tests to assess incoming students' preparedness for mathematics courses. Diagnostic test results can help students to identify topics where they need more practice and give lecturers a summary of strengths and weaknesses in their class. We demonstrate a process that can be used to make improvements to a mathematics…
Descriptors: Mathematics Tests, Diagnostic Tests, Test Items, Item Analysis
Krystal Thomas; Todd A. Grindal; Daisy Wise Rutstein; Gullnar Syed; Sarah Nixon Gerard; Shari Golan; Sheryl Cababa; Amanda Di Dio; Behnosh Najafi; Kat Ward – SRI Education, a Division of SRI International, 2023
Instructional coaching, informed by observation tools that measure teachers' practices, has been effective in improving teaching quality in early learning programs. However, existing measurement tools limit teachers' abilities to implement this type of instructional coaching at scale. To address this challenge, a team at SRI Education, along with…
Descriptors: Preschool Education, Kindergarten, Coaching (Performance), Observation
Christian X. Navarro-Cota; Ana I. Molina; Miguel A. Redondo; Carmen Lacave – IEEE Transactions on Education, 2024
Contribution: This article describes the process used to create a questionnaire to evaluate the usability of mobile learning applications (CECAM). The questionnaire includes specific questions to assess user interface usability and pedagogical usability. Background: Nowadays, mobile applications are expanding rapidly and are commonly used in…
Descriptors: Usability, Questionnaires, Electronic Learning, Computer Oriented Programs
Yan Jin; Jason Fan – Language Assessment Quarterly, 2023
In language assessment, AI technology has been incorporated in task design, assessment delivery, automated scoring of performance-based tasks, score reporting, and provision of feedback. AI technology is also used for collecting and analyzing performance data in language assessment validation. Research has been conducted to investigate the…
Descriptors: Language Tests, Artificial Intelligence, Computer Assisted Testing, Test Format
Tyagi, Navneesh; Moses, D. Baby – International Journal of Leadership in Education, 2022
India is the second largest higher education network in the world, where the business environment is highly complex and competitive. So, what we need is an element of distinctiveness in our institutions of higher learning. A modification is required in the way these institutions are being run and supervised. A workable solution for managing and…
Descriptors: Foreign Countries, Higher Education, Administrator Effectiveness, Test Construction
Emily L. Coderre – College Teaching, 2024
Psychometrics is the field of designing tests and assessments to measure certain psychological concepts. It is chiefly concerned with two fundamental properties: reliability and validity. These properties are often influenced by confounding variables: other things that can influence performance but are not what you are trying to measure. Here, I…
Descriptors: Teaching Methods, Psychometrics, Test Construction, Test Reliability
Pejman Ghasemi Poor Sabet; Shen Zhan; Milad Baghalzadeh Shishehgarkhaneh – Education Research and Perspectives, 2024
Artificial intelligence (AI) is transforming multiple facets of the competitive business world and education. Despite this, the full potential of AI applications within education remains unclear because of the lack of a comprehensive framework on how to use AI in developing assessments across various academic disciplines. While incorporating AI…
Descriptors: Artificial Intelligence, Test Construction, Engineering Education, Technology Uses in Education
Leighton, Jacqueline P. – Applied Measurement in Education, 2021
The objective of this paper is to comment on the think-aloud methods presented in the three papers included in this special issue. The commentary offered stems from the author's own psychological investigations of unobservable information processes and the conditions under which the most defensible claims can be advanced. The structure of this…
Descriptors: Protocol Analysis, Data Collection, Test Construction, Test Validity
Student, Sanford R.; Gong, Brian – Educational Measurement: Issues and Practice, 2022
We address two persistent challenges in large-scale assessments of the Next Generation Science Standards: (a) the validity of score interpretations that target the standards broadly and (b) how to structure claims for assessments of this complex domain. The NGSS pose a particular challenge for specifying claims about students that evidence from…
Descriptors: Science Tests, Test Validity, Test Items, Test Construction
Kotecki, Jerome E.; Greene, Maurita A.; Khubchandani, Jagdish; Kandiah, Jayanthi – American Journal of Health Education, 2021
Background: Diet quality assessment in community health settings is critical to reduce the incidence and improve management of diet-related chronic disease. Unfortunately, understandable and actionable brief dietary screening tools that empower individuals are nearly absent. Purpose: The purpose of this article is to describe two rigorous…
Descriptors: Dietetics, Screening Tests, Counseling, Health Education