Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 13 |
Since 2006 (last 20 years) | 156 |
Descriptor
Comparative Analysis | 286 |
Reliability | 131 |
Test Reliability | 108 |
Test Validity | 65 |
Foreign Countries | 60 |
Interrater Reliability | 53 |
Validity | 52 |
Evaluation Methods | 49 |
Scores | 39 |
Psychometrics | 38 |
Measures (Individuals) | 37 |
More ▼ |
Source
Author
Lunz, Mary E. | 4 |
Coniam, David | 3 |
Feldt, Leonard S. | 3 |
Alsawalmeh, Yousef M. | 2 |
Brennan, Robert L. | 2 |
Fuchs, Lynn S. | 2 |
Lee, Guemin | 2 |
Linn, Robert L. | 2 |
Martin, Michael O., Ed. | 2 |
O'Neill, Thomas R. | 2 |
Ritvo, Riva Ariella | 2 |
More ▼ |
Publication Type
Education Level
Audience
Practitioners | 8 |
Researchers | 3 |
Teachers | 3 |
Administrators | 2 |
Counselors | 1 |
Media Staff | 1 |
Policymakers | 1 |
Support Staff | 1 |
Location
United States | 9 |
Canada | 7 |
Taiwan | 5 |
Australia | 4 |
Hong Kong | 3 |
Portugal | 3 |
United Kingdom | 3 |
Belgium | 2 |
China | 2 |
Finland | 2 |
Florida | 2 |
More ▼ |
Laws, Policies, & Programs
Improving Americas Schools… | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Junjie, Ma; Yingxin, Ma – Online Submission, 2022
This paper aims to explore the philosophical theoretical foundations of two basic research paradigms, namely positivism and interpretivism. In the discussion process, literature in the relevant fields including academic papers and books is reviewed and used as support for the analysis. Firstly, the paper explores the differences between the…
Descriptors: Ideology, Bias, Credibility, Research Methodology
Yuan Tian; Xi Yang; Suhail A. Doi; Luis Furuya-Kanamori; Lifeng Lin; Joey S. W. Kwong; Chang Xu – Research Synthesis Methods, 2024
RobotReviewer is a tool for automatically assessing the risk of bias in randomized controlled trials, but there is limited evidence of its reliability. We evaluated the agreement between RobotReviewer and humans regarding the risk of bias assessment based on 1955 randomized controlled trials. The risk of bias in these trials was assessed via two…
Descriptors: Risk, Randomized Controlled Trials, Classification, Robotics
Rohlfing, Ingo – Field Methods, 2020
Empirical researchers using qualitative comparative analysis (QCA) can work with crisp, multivalue, and fuzzy sets. The relative advantages of crisp and multivalue sets have been discussed in the QCA literature. There has been little reflection on the more frequent decision between crisp and fuzzy sets for which there often is no theoretical…
Descriptors: Qualitative Research, Comparative Analysis, Reliability, Classification
DeLuca, Stefanie – Sociological Methods & Research, 2023
Increasingly, the broader public, media and policymakers are looking to qualitative research to provide answers to our most pressing social questions. While an exciting and perhaps overdue moment for qualitative researchers, it is also a time when the method is coming under increasing scrutiny for a lack of reliability and transparency. The…
Descriptors: Qualitative Research, Reliability, Standards, Participant Observation
Lisa Frances; Frances Quinn; Sue Elliott; Jo Bird – Australian Educational Researcher, 2024
In this article, we explore inconsistencies in the implementation of outdoor learning across Australian early years' education. The benefits of outdoor learning justify regular employment of this pedagogical approach in both early childhood education and primary school settings. Early childhood education services provide daily outdoor learning…
Descriptors: Foreign Countries, Outdoor Education, Program Implementation, Elementary Education
Lee, Won-Chan; Kim, Stella Y.; Choi, Jiwon; Kang, Yujin – Journal of Educational Measurement, 2020
This article considers psychometric properties of composite raw scores and transformed scale scores on mixed-format tests that consist of a mixture of multiple-choice and free-response items. Test scores on several mixed-format tests are evaluated with respect to conditional and overall standard errors of measurement, score reliability, and…
Descriptors: Raw Scores, Item Response Theory, Test Format, Multiple Choice Tests
Alqarni, Abdulelah Mohammed – Journal on Educational Psychology, 2019
This study compares the psychometric properties of reliability in Classical Test Theory (CTT), item information in Item Response Theory (IRT), and validation from the perspective of modern validity theory for the purpose of bringing attention to potential issues that might exist when testing organizations use both test theories in the same testing…
Descriptors: Test Theory, Item Response Theory, Test Construction, Scoring
Artamonova, Tatiana – Foreign Language Annals, 2020
This article describes the development of a new questionnaire for assessing L2 learners' language attitudes. Drawing on theoretical work in the fields of social psychology and applied linguistics, the author reviewed the concept of (language) attitudes and contrasted them with the concept of (language) motivation. This thorough literature review,…
Descriptors: Student Attitudes, Language Attitudes, Second Language Learning, Second Language Instruction
Braumoeller, Bear F. – Sociological Methods & Research, 2017
Fuzzy-set qualitative comparative analysis (fsQCA) has become one of the most prominent methods in the social sciences for capturing causal complexity, especially for scholars with small- and medium-"N" data sets. This research note explores two key assumptions in fsQCA's methodology for testing for necessary and sufficient…
Descriptors: Qualitative Research, Comparative Analysis, Social Science Research, Research Methodology
Isbell, Dan; Winke, Paula – Language Testing, 2019
The American Council on the Teaching of Foreign Languages (ACTFL) oral proficiency interview -- computer (OPIc) testing system represents an ambitious effort in language assessment: Assessing oral proficiency in over a dozen languages, on the same scale, from virtually anywhere at any time. Especially for users in contexts where multiple foreign…
Descriptors: Oral Language, Language Tests, Language Proficiency, Second Language Learning
Ho, Andrew D.; Kane, Thomas J. – Bill & Melinda Gates Foundation, 2013
For many teachers, the classroom observation has been the only opportunity to receive direct feedback from another school professional. As such, it is an indispensable part of every teacher evaluation system. Yet it also requires a major time commitment from teachers, principals, and peer observers. To justify the investment of time and resources,…
Descriptors: Observation, Teacher Evaluation, Accuracy, Reliability
Tschichold, Cornelia – Research-publishing.net, 2019
Calls for replication studies are becoming more frequent, and Computer Assisted Language Learning (CALL) has now reached sufficient maturity to offer numerous studies that lend themselves to replication. Realistic and successful replications rely on transparency in terms of data, results, and methodology. Two published studies in the area of…
Descriptors: Computer Assisted Instruction, Second Language Learning, Second Language Instruction, Computer Software
Wilkin, John P. – College & Research Libraries, 2017
The 1961 Copyright Office study on renewals, authored by Barbara Ringer, has cast an outsized influence on discussions of the U.S. 1923-1963 public domain. As more concrete data emerge from initiatives such as the large-scale determination process in the Copyright Review Management System (CRMS) project, questions are raised about the reliability…
Descriptors: Comparative Analysis, Copyrights, Misconceptions, Test Reliability
Shu, Lianghua; Schwarz, Richard D. – Journal of Educational Measurement, 2014
As a global measure of precision, item response theory (IRT) estimated reliability is derived for four coefficients (Cronbach's a, Feldt-Raju, stratified a, and marginal reliability). Models with different underlying assumptions concerning test-part similarity are discussed. A detailed computational example is presented for the targeted…
Descriptors: Item Response Theory, Reliability, Models, Computation
Mahfoud, Ziyad; Ghandour, Lilian; Ghandour, Blanche; Mokdad, Ali H.; Sibai, Abla M. – Field Methods, 2015
Findings on the reliability and cost-effectiveness of the use of cellular phones vis-à-vis face-to-face interviews in investigating health behaviors and conditions are presented for a national epidemiological sample from Lebanon. Using self-reported responses on identical questions, percentage agreement, ? statistics, and McNemar's test were used…
Descriptors: Telephone Surveys, Interviews, Surveys, Responses