Log in

Relevant bibliographies by topics / Item response theory – Statistical methods / Journal articles

To see the other types of publications on this topic, follow the link: Item response theory – Statistical methods.

Journal articles on the topic 'Item response theory – Statistical methods'

Author: Grafiati

Published: 4 June 2021

Last updated: 2 February 2022

Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles

Select a source type:

Consult the top 50 journal articles for your research on the topic 'Item response theory – Statistical methods.'

Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.

You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.

Browse journal articles on a wide variety of disciplines and organise your bibliography correctly.

1

Ibrahim, Abdul Wahab. "The Applicability of Item Response Theory Based Statistics to Detect Differential Item Functioning in Polytomous Tests." Randwick International of Education and Linguistics Science Journal 1, no. 1 (June 23, 2020): 1–13. http://dx.doi.org/10.47175/rielsj.v1i1.23.

Full text

Abstract:

The study used statistical procedures based on Item Response Theory to detect Differential Item Functioning (DIF) in polytomous tests. These were with a view to improving the quality of test items construction. The sample consisted of an intact class of 513 Part 3 undergraduate students who registered for the course EDU 304: Tests and Measurement at Sule Lamido University during 2017/2018 Second Semester. A self-developed polytomous research instrument was used to collect data. Data collected were analysed using Generalized Mantel Haenszel, Simultaneous Item Bias Test, and Logistic Discriminant Function Analysis. The results showed that there was no significant relationship between the proportions of test items that function differentially in the polytomous test when the different statistical methods are used. Further, the three parametric and non-parametric methods complement each other in their ability to detect DIF in the polytomous test format as all of them have capacity to detect DIF but perform differently. The study concluded that there was a high degree of correspondence between the three procedures in their ability to detect DIF in polytomous tests. It was recommended that test experts and developers should consider using procedure based on Item Response Theory in DIF detection.

APA, Harvard, Vancouver, ISO, and other styles

2

Veilleux, Jennifer C., and Kate M. Chapman. "Development of a Research Methods and Statistics Concept Inventory." Teaching of Psychology 44, no. 3 (May 30, 2017): 203–11. http://dx.doi.org/10.1177/0098628317711287.

Full text

Abstract:

Research methods and statistics are core courses in the undergraduate psychology major. To assess learning outcomes, it would be useful to have a measure that assesses research methods and statistical literacy beyond course grades. In two studies, we developed and provided initial validation results for a research methods and statistical knowledge concept inventory for eventual use in further scholarship of teaching and learning. In Study 1, we created vignettes and administered open-ended questions to psychology subject pool students. In Study 2, we refined the vignettes and created multiple-choice items using participant responses from Study 1. After administering the measure to psychology subject pool students and a community-based sample of Amazon Mechanical Turk workers, we used item response theory to select 20 items to compose the final Psychological Research Inventory of Concepts.

APA, Harvard, Vancouver, ISO, and other styles

3

Fergadiotis, Gerasimos, Stacey Kellough, and William D. Hula. "Item Response Theory Modeling of the Philadelphia Naming Test." Journal of Speech, Language, and Hearing Research 58, no. 3 (June 2015): 865–77. http://dx.doi.org/10.1044/2015_jslhr-l-14-0249.

Full text

Abstract:

Purpose In this study, we investigated the fit of the Philadelphia Naming Test (PNT; Roach, Schwartz, Martin, Grewal, & Brecher, 1996) to an item-response-theory measurement model, estimated the precision of the resulting scores and item parameters, and provided a theoretical rationale for the interpretation of PNT overall scores by relating explanatory variables to item difficulty. This article describes the statistical model underlying the computer adaptive PNT presented in a companion article (Hula, Kellough, & Fergadiotis, 2015). Method Using archival data, we evaluated the fit of the PNT to 1- and 2-parameter logistic models and examined the precision of the resulting parameter estimates. We regressed the item difficulty estimates on three predictor variables: word length, age of acquisition, and contextual diversity. Results The 2-parameter logistic model demonstrated marginally better fit, but the fit of the 1-parameter logistic model was adequate. Precision was excellent for both person ability and item difficulty estimates. Word length, age of acquisition, and contextual diversity all independently contributed to variance in item difficulty. Conclusions Item-response-theory methods can be productively used to analyze and quantify anomia severity in aphasia. Regression of item difficulty on lexical variables supported the validity of the PNT and interpretation of anomia severity scores in the context of current word-finding models.

APA, Harvard, Vancouver, ISO, and other styles

4

Blanchin, Myriam, Alice Guilleux, Jean-Benoit Hardouin, and Véronique Sébille. "Comparison of structural equation modelling, item response theory and Rasch measurement theory-based methods for response shift detection at item level: A simulation study." Statistical Methods in Medical Research 29, no. 4 (October 30, 2019): 1015–29. http://dx.doi.org/10.1177/0962280219884574.

Full text

Abstract:

When assessing change in patient-reported outcomes, the meaning in patients’ self-evaluations of the target construct is likely to change over time. Therefore, methods evaluating longitudinal measurement non-invariance or response shift at item-level were proposed, based on structural equation modelling or on item response theory. Methods coming from Rasch measurement theory could also be valuable. The lack of evaluation of these approaches prevents determining the best strategy to adopt. A simulation study was performed to compare and evaluate the performance of structural equation modelling, item response theory and Rasch measurement theory approaches for item-level response shift detection. Performances of these three methods in different situations were evaluated with the rate of false detection of response shift (when response shift was not simulated) and the rate of correct response shift detection (when response shift was simulated). The Rasch measurement theory-based method performs better than the structural equation modelling and item response theory-based methods when recalibration was simulated. Consequently, the Rasch measurement theory-based approach should be preferred for studies investigating only recalibration response shift at item-level. For structural equation modelling and item response theory, the low rates of reprioritization detection raise issues on the potential different meaning and interpretation of reprioritization at item-level.

APA, Harvard, Vancouver, ISO, and other styles

5

Dardick, William R., and Brandi A. Weiss. "An Investigation of Chi-Square and Entropy Based Methods of Item-Fit Using Item Level Contamination in Item Response Theory." Journal of Modern Applied Statistical Methods 18, no. 2 (October 2, 2020): 2–43. http://dx.doi.org/10.22237/jmasm/1604190480.

Full text

Abstract:

New variants of entropy as measures of item-fit in item response theory are investigated. Monte Carlo simulation(s) examine aberrant conditions of item-level misfit to evaluate relative (compare EMRj, X2, G2, S-X2, and PV-Q1) and absolute (Type I error and empirical power) performance. EMRj has utility in discovering misfit.

APA, Harvard, Vancouver, ISO, and other styles

6

Abdelhamid, Gomaa Said Mohamed, Marwa Gomaa Abdelghani Bassiouni, and Juana Gómez-Benito. "Assessing Cognitive Abilities Using the WAIS-IV: An Item Response Theory Approach." International Journal of Environmental Research and Public Health 18, no. 13 (June 25, 2021): 6835. http://dx.doi.org/10.3390/ijerph18136835.

Full text

Abstract:

Background: The Wechsler Adult Intelligence Scale-Fourth Edition (WAIS-IV) has been adapted to 28 different cultures and there has been considerable interest in examining its structure through exploratory and confirmatory factor analysis. This study investigates item and scale properties of the Egyptian WAIS-IV using item response theory (IRT) models. Methods: The sample consisted of 250 adults from Egypt. The item-level and subtest statistical properties of the Egyptian WAIS-IV were established using a combination of four dichotomous IRT models and four polytomous IRT models. In addition, factor analysis was performed to investigate the dimensionality of each subtest. Results: Factor analysis indicated the unidimensionality of each subtest. Among IRT models, the two-parameter logistic model provided a good fit for dichotomous subtests, while the graded response model fitted the polytomous data. Most items of the Egyptian WAIS-IV showed high discrimination, and the scale was adequately informative across the levels of latent traits (i.e., cognitive variables). However, each subtest included at least some items with limited ability to distinguish between individuals with differing levels of the cognitive variable being measured. Furthermore, most subtests have items that do not follow the difficulty rank they are ascribed in the WAIS-IV manual. Conclusions: Overall, the results suggest that the Egyptian WAIS-IV offers a highly valid assessment of intellectual abilities, despite the need for some improvements.

APA, Harvard, Vancouver, ISO, and other styles

7

Lisova, Tetiana V. "МЕТОДИ ТА ПРОГРАМНІ ЗАСОБИ ДОСЛІДЖЕННЯ ВАЛІДНОСТІ ТЕСТОВИХ РЕЗУЛЬТАТІВ ДЛЯ ГРУП ТЕСТОВАНИХ З ПЕВНИМИ ІНДИВІДУАЛЬНИМИ ОСОБЛИВОСТЯМИ." Information Technologies and Learning Tools 50, no. 6 (January 1, 2016): 165. http://dx.doi.org/10.33407/itlt.v50i6.1283.

Full text

Abstract:

The necessary condition for the presence of biased assessment by some test is differential item functioning in different groups of test takers. The ideas of some statistical methods for detecting Differential Item Functioning are described in the given article. They were developed in the framework of the main approaches to modeling test results: using contingency tables, regression models, multidimensional models and models of Item Response Theory. The Mantel-Haenszel procedure, logistic regression method, SIBTEST and Item Response Theory Likelihood Ratio Test are considered. The characteristics of each method and conditions of their application are specified. Overview of existing free software tools implementing these methods is carried out. Comparisons of these methods are conducted on the example of real data. Also notes that it is appropriate to use several methods simultaneously to reduce the risk of false conclusions.

APA, Harvard, Vancouver, ISO, and other styles

8

Et.al, Madeline D. Cabauatan. "Statistical Evaluation of Item Nonresponse Methods Using the World Bank’s 2015 Philippines Enterprise Survey." Turkish Journal of Computer and Mathematics Education (TURCOMAT) 12, no. 3 (April 11, 2021): 4077–88. http://dx.doi.org/10.17762/turcomat.v12i3.1698.

Full text

Abstract:

The main objective of the study was to evaluate item nonresponse procedures through a simulation study of different nonresponse levels or missing rates. A simulation study was used to explore how each of the response rates performs under a variety of circumstances. It also investigated the performance of procedures suggested for item nonresponse under various conditions and variable trends. The imputation methods considered were the cell mean imputation, random hotdeck, nearest neighbor, and simple regression. These variables are some of the major indicators for measuring productive labor and decent work in the country. For the purpose of this study, the researcher is interested in evaluating methods for imputing missing data for the number of workers and total cost of labor per establishment from the World Bank’s 2015 Enterprise Survey for the Philippines. The performances of the imputation techniques for item nonresponse were evaluated in terms of bias and coefficient of variation for accuracy and precision. Based on the results, the cell-mean imputation was seen to be most appropriate for imputing missing values for the total number of workers and total cost of labor per establishment. Since the study was limited to the variables cited, it is recommended to explore other labor indicators. Moreover, exploring choice of other clustering groups is highly recommended as clustering groups have great effect in the resulting estimates of imputation estimation. It is also recommended to explore other imputation techniques like multiple regression and other parametric models for nonresponse such as the Bayes estimation method. For regression based imputation, since the study is limited only in using the cluster groupings estimation, it is highly recommended to use other possible variables that might be related to the variable of interest to verify the results of this study.

APA, Harvard, Vancouver, ISO, and other styles

9

Liu, Yang, Guanyu Hu, Lei Cao, Xiaojing Wang, and Ming-Hui Chen. "A comparison of Monte Carlo methods for computing marginal likelihoods of item response theory models." Journal of the Korean Statistical Society 48, no. 4 (December 2019): 503–12. http://dx.doi.org/10.1016/j.jkss.2019.04.001.

Full text

APA, Harvard, Vancouver, ISO, and other styles

10

Bürkner, Paul-Christian, Niklas Schulte, and Heinz Holling. "On the Statistical and Practical Limitations of Thurstonian IRT Models." Educational and Psychological Measurement 79, no. 5 (February 22, 2019): 827–54. http://dx.doi.org/10.1177/0013164419832063.

Full text

Abstract:

Forced-choice questionnaires have been proposed to avoid common response biases typically associated with rating scale questionnaires. To overcome ipsativity issues of trait scores obtained from classical scoring approaches of forced-choice items, advanced methods from item response theory (IRT) such as the Thurstonian IRT model have been proposed. For convenient model specification, we introduce the thurstonianIRT R package, which uses Mplus, lavaan, and Stan for model estimation. Based on practical considerations, we establish that items within one block need to be equally keyed to achieve similar social desirability, which is essential for creating forced-choice questionnaires that have the potential to resist faking intentions. According to extensive simulations, measuring up to five traits using blocks of only equally keyed items does not yield sufficiently accurate trait scores and inter-trait correlation estimates, neither for frequentist nor for Bayesian estimation methods. As a result, persons’ trait scores remain partially ipsative and, thus, do not allow for valid comparisons between persons. However, we demonstrate that trait scores based on only equally keyed blocks can be improved substantially by measuring a sizable number of traits. More specifically, in our simulations of 30 traits, scores based on only equally keyed blocks were non-ipsative and highly accurate. We conclude that in high-stakes situations where persons are motivated to give fake answers, Thurstonian IRT models should only be applied to tests measuring a sizable number of traits.

APA, Harvard, Vancouver, ISO, and other styles

11

Liu, Yang, Guanyu Hu, Lei Cao, Xiaojing Wang, and Ming-Hui Chen. "Rejoinder: A comparison of Monte Carlo methods for computing marginal likelihoods of item response theory models." Journal of the Korean Statistical Society 48, no. 4 (December 2019): 522–23. http://dx.doi.org/10.1016/j.jkss.2019.07.001.

Full text

APA, Harvard, Vancouver, ISO, and other styles

12

Lim, Chae Young. "Discussion on “A comparison of Monte Carlo methods for computing marginal likelihoods of item response theory models”." Journal of the Korean Statistical Society 48, no. 4 (December 2019): 520–21. http://dx.doi.org/10.1016/j.jkss.2019.07.002.

Full text

APA, Harvard, Vancouver, ISO, and other styles

13

Alnasraween, Moen Salman, Ayat Mohammad Almughrabi, Raeda Mofid Ammari, and Mohammad Saleh Alkaramneh. "Validity and reliability of eight-grade digital culture test in light of item response theory." Cypriot Journal of Educational Sciences 16, no. 4 (August 31, 2021): 1816–35. http://dx.doi.org/10.18844/cjes.v16i4.6034.

Full text

Abstract:

The purpose of this study is to construct a digital culture test in light of the Item Response Theory and to investigate its psychometric properties. The study sample consisted of six hundred fifty (650) male and female students in the eighth grade from the Directorate of Education and Teaching of Salt District. To obtain the results, the descriptive approach was used. The results showed that the items have acceptable indicators of discrimination and extend on the continuum of difficulty adequately. The validity and reliability of the test were verified by using several methods, including content validity and internal consistency. The study findings showed that most of the test items fit the assumptions of the two-parameter logistic model. The results also displayed statistically significant differences in the arithmetic means of the digital culture test due to gender in favor of female students. Moreover, the outcomes presented statistically significant differences attributed to the education sector variable favoring the private sector. Keywords: Digital Culture Test, IRT, Psychometric Properties, Teaching.

APA, Harvard, Vancouver, ISO, and other styles

14

Harel, Daphna, and Murray Baron. "Methods for shortening patient-reported outcome measures." Statistical Methods in Medical Research 28, no. 10-11 (August 20, 2018): 2992–3011. http://dx.doi.org/10.1177/0962280218795187.

Full text

Abstract:

Patient-reported outcome measures are widely used to assess patient experiences, well-being, and treatment response in clinical trials and cohort-based observational studies. However, patients may be asked to respond to many different measures in order to provide researchers and clinicians with a wide array of information regarding their experiences. Collecting such long and cumbersome patient-reported outcome measures may burden patients, increase research costs, and potentially reduce the quality of the data collected. Nonetheless, little research has been conducted on replicable, and reproducible methods to shorten these instruments that result in shortened forms of minimal length. This manuscript proposes the use of mixed integer programming through Optimal Test Assembly as a method to shorten patient-reported outcome measures. This method is compared to the existing standard in the field, which is selecting items based on having high discrimination parameters from an item response theory model. The method is then illustrated in an application to a fatigue scale for patients with Systemic Sclerosis.

APA, Harvard, Vancouver, ISO, and other styles

15

Nagai, Takashi, Yoshikazu Ishii, Kengo Kohiyama, Takahiro Takenaka, and Takashi Yamada. "Development of the final version of the occupational adaptation questionnaire for family caregivers." Hong Kong Journal of Occupational Therapy 33, no. 2 (December 2020): 82–89. http://dx.doi.org/10.1177/1569186120983356.

Full text

Abstract:

Background Given the growing role of family caregivers in meeting the increasing demand for ageing-related care in Japan, the occupational adaptation of family caregivers needs to be evaluated. Methods This study developed the Occupational Adaptation Questionnaire for Family Caregivers (OAQC) and evaluated its reliability, validity, and effectiveness. To develop the OAQC item pool, the construct ‘Occupation of nursing care engaged in by family caregivers’ was evaluated. Using the Delphi method, 8 experts evaluated 64 items and confirmed that 41 items adequately represented the domain. The scale was then completed by 216 family caregivers of users of outpatient service establishments affiliated with hospitals in the Chubu region. The validity of the scale items was tested, and statistical analysis was performed. The construct validity and internal consistency of the OAQC were examined using exploratory factor analysis and the ω coefficient, respectively. Results The discrimination ability and difficulty level were calculated using the item response theory. The findings showed that the scale’s reliability and validity were satisfactory. Construct validity was acceptable for 16 items with 5 factors. The OAQC also had high internal consistency, reliability, and effectiveness. The item slope parameters and difficulty parameters revealed good item response, indicating that the scale could effectively measure family caregivers’ occupational adaptation. Conclusion The scale was compatible with the data obtained from family caregivers. Given its overall effectiveness, data obtained using the OAQC can contribute towards implementing personal care support programs for caregivers and enriching their lives by offering support from an occupational adaptation perspective.

APA, Harvard, Vancouver, ISO, and other styles

16

FRIES, JAMES F., DAVID CELLA, MATTHIAS ROSE, ESWAR KRISHNAN, and BONNIE BRUCE. "Progress in Assessing Physical Function in Arthritis: PROMIS Short Forms and Computerized Adaptive Testing." Journal of Rheumatology 36, no. 9 (September 2009): 2061–66. http://dx.doi.org/10.3899/jrheum.090358.

Full text

Abstract:

Objective.Assessing self-reported physical function/disability with the Health Assessment Questionnaire Disability Index (HAQ) and other instruments has become central in arthritis research. Item response theory (IRT) and computerized adaptive testing (CAT) techniques can increase reliability and statistical power. IRT-based instruments can improve measurement precision substantially over a wider range of disease severity. These modern methods were applied and the magnitude of improvement was estimated.Methods.A 199-item physical function/disability item bank was developed by distilling 1865 items to 124, including Legacy Health Assessment Questionnaire (HAQ) and Physical Function-10 items, and improving precision through qualitative and quantitative evaluation in over 21,000 subjects, which included about 1500 patients with rheumatoid arthritis and osteoarthritis. Four new instruments, (A) Patient-Reported Outcomes Measurement Information (PROMIS) HAQ, which evolved from the original (Legacy) HAQ; (B) “best” PROMIS 10; (C) 20-item static (short) forms; and (D) simulated PROMIS CAT, which sequentially selected the most informative item, were compared with the HAQ.Results.Online and mailed administration modes yielded similar item and domain scores. The HAQ and PROMIS HAQ 20-item scales yielded greater information content versus other scales in patients with more severe disease. The “best” PROMIS 20-item scale outperformed the other 20-item static forms over a broad range of 4 standard deviations. The 10-item simulated PROMIS CAT outperformed all other forms.Conclusion.Improved items and instruments yielded better information. The PROMIS HAQ is currently available and considered validated. The new PROMIS short forms, after validation, are likely to represent further improvement. CAT-based physical function/disability assessment offers superior performance over static forms of equal length.

APA, Harvard, Vancouver, ISO, and other styles

17

Seo, Dong Gi, Myeong Gi Kim, Na Hui Kim, Hye Sook Shin, and Hyun Jung Kim. "Linear programming method to construct equated item sets for the implementation of periodical computer-based testing for the Korean Medical Licensing Examination." Journal of Educational Evaluation for Health Professions 15 (October 18, 2018): 26. http://dx.doi.org/10.3352/jeehp.2018.15.26.

Full text

Abstract:

Purpose: This study aimed to identify the best way of developing equivalent item sets and to propose a stable and effective management plan for periodical licensing examinations. Methods: Five pre-equated item sets were developed based on the predicted correct answer rate of each item using linear programming. These pre-equated item sets were compared to the ones that were developed with a random item selection method based on the actual correct answer rate (ACAR) and difficulty from item response theory (IRT). The results with and without common items were also compared in the same way. ACAR and the IRT difficulty were used to determine whether there was a significant difference between the pre-equating conditions. Results: There was a statistically significant difference in IRT difficulty among the results from different pre-equated conditions. The predicted correct answer rate was divided using 2 or 3 difficulty categories, and the ACAR and IRT difficulty parameters of the 5 item sets were equally constructed. Comparing the item set conditions with and without common items, including common items did not make a significant contribution to the equating of the 5 item sets. Conclusion: This study suggested that the linear programming method is applicable to construct equated-item sets that reflect each content area. The suggested best method to construct equated item sets is to divide the predicted correct answer rate using 2 or 3 difficulty categories, regardless of common items. If pre-equated item sets are required to construct a test based on the actual data, several methods should be considered by simulation studies to determine which is optimal before administering a real test.

APA, Harvard, Vancouver, ISO, and other styles

18

Sharp, Carla, Lynne Steinberg, Jared Michonski, Allison Kalpakci, Chris Fowler, B. Christopher Frueh, and Peter Fonagy. "DSM Borderline Criterion Function Across Age-Groups: A Cross-Sectional Mixed-Method Study." Assessment 26, no. 6 (July 13, 2018): 1014–29. http://dx.doi.org/10.1177/1073191118786587.

Full text

Abstract:

DSM-5 ( Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition) Section II criteria for borderline personality disorder (BPD) lack developmental operationalization. The aim of the current study was to evaluate whether DSM criteria operate similarly across adolescents and adults to determine if developmental adjustment for DSM criteria was needed. Three age cohorts were recruited: adolescents (ages 12-17 years; n = 484), young adults (ages 18-25 years; n = 442), and adults (ages ≥26 years; n = 953). The Child Interview for DSM-IV BPD and the Structured Clinical Interview for DSM-IV Axis II disorders were administered to adolescents and adults, respectively. Item response theory methods were used to evaluate differential item (or criterion) functioning (DIF) of BPD criteria across adolescents and adults. Qualitative analyses were then used to evaluate the potential sources of DIF. Item response theory results demonstrated DIF across adolescents and adults for all DSM BPD criteria. Qualitative analyses suggested that the source of DIF was most likely due to rater/interviewer bias. Results furthermore suggested that behavioral criteria may represent the heterotypic features of BPD, while intra- and interpersonal criteria represent the homotypic features of the disorder. The article concludes with recommendations for developmentally informed guidelines for the assessment of BPD.

APA, Harvard, Vancouver, ISO, and other styles

19

She, Menghua, Yaling Li, Dongbo Tu, and Yan Cai. "Computerized Adaptive Testing for Sleep Disorders." European Journal of Health Psychology 28, no. 3 (July 2021): 89–100. http://dx.doi.org/10.1027/2512-8442/a000076.

Full text

Abstract:

Abstract. Background: As more and more people suffer from sleep disorders, the need to develop an efficient, inexpensive, and accurate assessment tool for screening sleep disorders has become more urgent. Aim: The aim of the current study was to develop a system allowing computerized adaptive testing for sleep disorders (CAT-SD). Methods: A large sample ( N = 1,304) was recruited to construct an item bank for CAT-SD and to investigate the psychometric characteristics of CAT-SD. First, analyses of unidimensionality, model fit, item fit, item discrimination parameters, and differential item functioning (DIF) were conducted to construct a final item pool to meet the requirements of item response theory measurement. Then, a simulated CAT study with real data was performed to investigate the psychometric characteristics of CAT-SD, including the reliability, validity, and predictive utility (sensitivity and specificity). Results: The final unidimensional item bank of the CAT-SD had good item fit, high discrimination, and no DIF. Moreover, it had acceptable reliability, validity, and predictive utility. Limitations: Non-statistical assembly constraints, execution environment, construction of item bank, criterion-related validity, and predictive utility (sensitivity and specificity) of CAT-SD, and sample representativeness are discussed. Conclusions: The CAT-SD could be used as an effective and accurate assessment tool for measuring the sleep disorders in individuals and offers a novel approach to the screening of sleep disorders utilizing psychological scales.

APA, Harvard, Vancouver, ISO, and other styles

20

Cúri, Mariana, and Vanessa Silva. "Academic English Proficiency Assessment Using a Computerized Adaptive Test." TEMA - Tendências em Matemática Aplicada e Computacional 20, no. 2 (July 29, 2019): 381. http://dx.doi.org/10.5540/tema.2019.020.02.381.

Full text

Abstract:

This paper describes the steps to convert a paper-and-pencil English proficiency test for academic purposes, consisting of multiple choice items administered following the Admissible Probability Measurement Procedure [24], adopted by the graduate program at the Institute of Mathematics and Computer Sciences at the University of São Paulo (ICMC-USP), Brazil, to a computerized adaptive test (CAT) based on an Item Response Theory Model (IRT). Despite the fact that the program accepts various internationally recognized tests that attest non-native speakers English proficiency, such as the Test of English as a Foreign Language (TOEFL), the International English Language Testing System (IELTS) and the Cambridge English: Proficiency (CPE), for instance, its requirement is incoherent in public universities in Brazil due to the cost, which ranges from US$ 200.00 to US$ 300.00 per exam. The TAI-PI software (Computerized Adaptive Test for English Proficiency), which was developed in Java language and SQLite, started to be used to assess the English pro?ciency of students on the program from October, 2013. The statistical methodology used was defined considering the history and aims of the test and adopted Samejima's Graded Response Model [21], the Kullback-Leibler information criterion for item selection, the a posteriori estimation method for latent trait [2] and the Shadow Test approach [29] to impose restrictions (content and test length) on the test composition of each individual. A description of the test design, the statistical methods used, and the results of a real application of TAI-PI for graduate students are presented in this paper, as well as the validation studies of the new methodology for pass or fail classification, showing the good quality of the new evaluation system and examination of improvement using the IRT and CAT methods.

APA, Harvard, Vancouver, ISO, and other styles

21

Bullock, Will, Kosuke Imai, and Jacob N. Shapiro. "Statistical Analysis of Endorsement Experiments: Measuring Support for Militant Groups in Pakistan." Political Analysis 19, no. 4 (2011): 363–84. http://dx.doi.org/10.1093/pan/mpr031.

Full text

Abstract:

Political scientists have long been interested in citizens' support level for such actors as ethnic minorities, militant groups, and authoritarian regimes. Attempts to use direct questioning in surveys, however, have largely yielded unreliable measures of these attitudes as they are contaminated by social desirability bias and high nonresponse rates. In this paper, we develop a statistical methodology to analyze endorsement experiments, which recently have been proposed as a possible solution to this measurement problem. The commonly used statistical methods are problematic because they cannot properly combine responses across multiple policy questions, the design feature of a typical endorsement experiment. We overcome this limitation by using item response theory to estimate support levels on the same scale as the ideal points of respondents. We also show how to extend our model to incorporate a hierarchical structure of data in order to uncover spatial variation of support while recouping the loss of statistical efficiency due to indirect questioning. We illustrate the proposed methodology by applying it to measure political support for Islamist militant groups in Pakistan. Simulation studies suggest that the proposed Bayesian model yields estimates with reasonable levels of bias and statistical power. Finally, we offer several practical suggestions for improving the design and analysis of endorsement experiments.

APA, Harvard, Vancouver, ISO, and other styles

22

Jamaluddine, Zeina, Nadine R. Sahyoun, Jowel Choufani, AnnieBelle J. Sassine, and Hala Ghattas. "Child-Reported Food Insecurity Is Negatively Associated with Household Food Security, Socioeconomic Status, Diet Diversity, and School Performance among Children Attending UN Relief and Works Agency for Palestine Refugees Schools in Lebanon." Journal of Nutrition 149, no. 12 (August 28, 2019): 2228–35. http://dx.doi.org/10.1093/jn/nxz189.

Full text

Abstract:

ABSTRACT Background Recent studies have shown that children are able to accurately describe their own experiences of food insecurity; however, few studies have quantified this experience. Objectives We aimed to develop and validate a child food security scale to be used in Arabic-speaking countries. Methods As part of the Healthy Kitchens Healthy Children study on food security and nutrition in schools, 14 questions were administered to 5–15-y-old children (n = 1433) attending 4 UN Relief and Works Agency for Palestine refugees schools. We used statistical methods based on Item Response Theory to assess the internal validity of a proposed food security scale using these items. Further validation of the scale was conducted by examining associations with sociodemographic, economic, household food security, and diet diversity variables through logistic regression models adjusted for age, sex, and school of the children. Results A total of 1287 children (89.8%) provided complete responses to the questionnaire. Four items were dropped from the scale because of high infit statistics (>1.3) and high residual correlation with other items. The resulting 10-item scale had acceptable reliability (Cronbach α coefficient = 0.89). Children who reported food insecurity were more likely to be from food-insecure households (OR: 2.3; P < 0.001). Food-insecure children came from households with lower expenditures per capita (OR: 0.80; P = 0.007) and higher household crowding (OR: 1.70; P = 0.001), and had less educated mothers (OR: 0.89; P = 0.009), lower household diet diversity (OR: 0.85; P < 0.001), and lower household consumption of meat (OR: 0.89; P < 0.001) and fruit (OR: 0.95; P = 0.001). Child food insecurity was also associated with higher school absenteeism (OR: 1.64; P = 0.01) and lower academic grades in languages and science (P < 0.05). Conclusions Child food security was accurately measured through a 10-item scale. This tool is appropriate to assess child food insecurity experience directly with children, and can be used in school-based interventions, where obtaining data from parents may be cumbersome.

APA, Harvard, Vancouver, ISO, and other styles

23

Paek, Insu. "A short Communication on the Article entitled “A comparison of Monte Carlo methods for computing marginal likelihoods of item response theory models”." Journal of the Korean Statistical Society 48, no. 4 (December 2019): 518–19. http://dx.doi.org/10.1016/j.jkss.2019.07.003.

Full text

APA, Harvard, Vancouver, ISO, and other styles

24

Hamel, JF, V. Sebille, T. Le Neel, G. Kubis, FC Boyer, and JB Hardouin. "What are the appropriate methods for analyzing patient-reported outcomes in randomized trials when data are missing?" Statistical Methods in Medical Research 26, no. 6 (November 6, 2015): 2897–908. http://dx.doi.org/10.1177/0962280215615158.

Full text

Abstract:

Subjective health measurements using Patient Reported Outcomes (PRO) are increasingly used in randomized trials, particularly for patient groups comparisons. Two main types of analytical strategies can be used for such data: Classical Test Theory (CTT) and Item Response Theory models (IRT). These two strategies display very similar characteristics when data are complete, but in the common case when data are missing, whether IRT or CTT would be the most appropriate remains unknown and was investigated using simulations. We simulated PRO data such as quality of life data. Missing responses to items were simulated as being completely random, depending on an observable covariate or on an unobserved latent trait. The considered CTT-based methods allowed comparing scores using complete-case analysis, personal mean imputations or multiple-imputations based on a two-way procedure. The IRT-based method was the Wald test on a Rasch model including a group covariate. The IRT-based method and the multiple-imputations-based method for CTT displayed the highest observed power and were the only unbiased method whatever the kind of missing data. Online software and Stata® modules compatibles with the innate mi impute suite are provided for performing such analyses. Traditional procedures (listwise deletion and personal mean imputations) should be avoided, due to inevitable problems of biases and lack of power.

APA, Harvard, Vancouver, ISO, and other styles

25

AGGEN, STEVEN H., MICHAEL C. NEALE, and KENNETH S. KENDLER. "DSM criteria for major depression: evaluating symptom patterns using latent-trait item response models." Psychological Medicine 35, no. 4 (December 2, 2004): 475–87. http://dx.doi.org/10.1017/s0033291704003563.

Full text

Abstract:

Background. Expert committees of clinicians have chosen diagnostic criteria for psychiatric disorders with little guidance from measurement theory or modern psychometric methods. The DSM-III-R criteria for major depression (MD) are examined to determine the degree to which latent trait item response models can extract additional useful information.Method. The dimensionality and measurement properties of the 9 DSM-III-R criteria plus duration are evaluated using dichotomous factor analysis and the Rasch and 2 parameter logistic item response models. Quantitative liability scales are compared with a binary DSM-III-R diagnostic algorithm variable to determine the ramifications of using each approach.Results. Factor and item response model results indicated the 10 MD criteria defined a reasonably coherent unidimensional scale of liability. However, person risk measurement was not optimal. Criteria thresholds were unevenly spaced leaving scale regions poorly measured. Criteria varied in discriminating levels of risk. Compared to a binary MD diagnosis, item response model (IRM) liability scales performed far better in (i) elucidating the relationship between MD symptoms and liability, (ii) predicting the personality trait of neuroticism and future depressive episodes and (iii) more precisely estimating heritability parameters.Conclusions. Criteria for MD largely defined a single dimension of disease liability although the quality of person risk measurement was less clear. The quantitative item response scales were statistically superior in predicting relevant outcomes and estimating twin model parameters. Item response models that treat symptoms as ordered indicators of risk rather than as counts towards a diagnostic threshold more fully exploit the information available in symptom endorsement data patterns.

APA, Harvard, Vancouver, ISO, and other styles

26

Aksu, Gokhan, Cigdem Reyhanlıoglu, and Mehmet Taha Eser. "Examining the Two Categorical Datas by Jmetrik, Bilog-Mg and Irtpro with Application of Mathematics Exam." European Scientific Journal, ESJ 13, no. 33 (November 30, 2017): 20. http://dx.doi.org/10.19044/esj.2017.v13n33p20.

Full text

Abstract:

The aim of this study was to examination of two-category rated mathematics course final exam based on Item Response Theory data analyzed with the help of 2-Parameter Logistic Model and determination of the ability and standard errors with the help of different programs. This study involves a comparative interpretation of some descriptive statistics and analysis. Therefore, research has characterized as relational model which is one of the general survey models. For this purpose, 771 students’ final achievement test responses to a 20-point final exam, were analyzed by BILOG, IRT PRO and JMETRİK programs. Item Response Theory assumptions were analyzed with SPSS and Factor 9.3 programs. Working as a result of the analysis of data all of the IRT assumptions are met and the most appropriate model of data set has been concluded that the twoparameter logistic model. The study also found that there is a statistically significant relationship between the estimated parameters related to individual ability and error at the level of .01. Especially compared to the others there is also significant relationship between JMETRİK and IRT PRO. Different models and methods of research proposals have been made in terms of response patterns to be analyzed a gain for the same data set.

APA, Harvard, Vancouver, ISO, and other styles

27

Zamzuri, Mohd ‘Ammar Ihsan Ahmad, Mohd Nazrin Jamhari, Hasanain Faisal Ghazi, Muhamad Hazizi Muhamad Hasani, Noor Khalili Mohd Ali, Mohammad Faid Abd. Rashid, Rozita Hod, Rahmat Dapari, and Mohd Rohaizat Hassan. "A unique double tango: Construct validation and reliability analysis of risk perception, attitude and practice (RPAP) questionnaire on dengue infection." PLOS ONE 16, no. 8 (August 24, 2021): e0256636. http://dx.doi.org/10.1371/journal.pone.0256636.

Full text

Abstract:

Introduction/background Dengue fever remains a public health threat despite being preventable. A solution to the constant problem of dengue infection will require active intervention and a paradigm shift. Assessing perceived risk and correlating it with the attitude and practice of the community will help in designing appropriate measures. However, possible instruments for these assessments come with limitations. Objective The aim is to develop and validate a new scoring-based questionnaire, using dual statistical approaches to measure risk perception, attitude, and practices (RPAP) related to dengue in the community. Methods The RPAP questionnaire was developed bilingually using the International Society for Pharmacoeconomics and Outcome Research (ISPOR) guidelines. Content analysis was reviewed scrupulously by four expert panels. The initial 35-item scale was tested among 253 Malaysian respondents recruited non-probabilistically via multiple online platforms. Two statistical methods were employed to measure the construct validity: Exploratory Factor Analysis (EFA) as part of the Classical Test Theory (CTT) measurement, while Rasch Measurement Analysis (Rasch) was performed for the Item Response Theory (IRT) measurement. All results were cross-validated with their counterpart to ensure stability. Confirmatory Factor Analysis (CFA) was used to obtain a model fit index. Results 29 questions were retained after the final analysis. Both EFA and Rasch analysis detect multidimensionality. Nine latent factors were extracted from EFA, while only eight factors remained in the final model following CFA: 1) perceived susceptibility; 2) perceived severity; 3) perceived barrier; 4) perceived benefit; 5) cues to action; 6) self-efficacy; 7) attitude; and 8) practice. All items had adequate factor loadings and showed good internal consistency. The final model after CFA achieved a good fit with an RMSEA value of 0.061, SRMR of 0.068, PNFI of 0.649, and GFI of 0.996. Conclusion The RPAP questionnaire contains 29 items and is a reliable and accurate psychometric instrument for measuring the risk perception of dengue fever, attitude, and practice of the community in dengue prevention. The Rasch measurement provides additional rigour to complement the CTT analysis. This RPAP questionnaire is suitable for use in studies related to dengue prevention in the community.

APA, Harvard, Vancouver, ISO, and other styles

28

Brambor, Thomas, Agustín Goenaga, Johannes Lindvall, and Jan Teorell. "The Lay of the Land: Information Capacity and the Modern State." Comparative Political Studies 53, no. 2 (May 27, 2019): 175–213. http://dx.doi.org/10.1177/0010414019843432.

Full text

Abstract:

This article presents new evidence on the efforts of states to collect and process information about themselves, their territories, and their populations. We compile data on five institutions and policies: the regular implementation of a reliable census, the regular release of statistical yearbooks, the introduction of civil and population registers, and the establishment of a government agency tasked with processing statistical information. Using item response theory methods, we generate an index of “information capacity” for 85 states from 1789 to the present. We then ask how political regime changes have influenced the development of information capacity over time. In contrast with the literature on democracy and fiscal capacity, we find that suffrage expansions are associated with higher information capacity, but increases in the level of political competition are not. These findings demonstrate the value of our new measure, because they suggest that different elements of state capacity are shaped by different historical processes.

APA, Harvard, Vancouver, ISO, and other styles

29

Guo, Wenyue, Haiyan Liu, Anzhu Yu, and Jing Li. "RESEARCH ON VISUAL ANALYSIS METHODS OF TERRORISM EVENTS." ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences XLI-B2 (June 7, 2016): 191–96. http://dx.doi.org/10.5194/isprsarchives-xli-b2-191-2016.

Full text

Abstract:

Under the situation that terrorism events occur more and more frequency throughout the world, improving the response capability of social security incidents has become an important aspect to test governments govern ability. Visual analysis has become an important method of event analysing for its advantage of intuitive and effective. To analyse events’ spatio-temporal distribution characteristics, correlations among event items and the development trend, terrorism event’s spatio-temporal characteristics are discussed. Suitable event data table structure based on “5W” theory is designed. Then, six types of visual analysis are purposed, and how to use thematic map and statistical charts to realize visual analysis on terrorism events is studied. Finally, experiments have been carried out by using the data provided by Global Terrorism Database, and the results of experiments proves the availability of the methods.

APA, Harvard, Vancouver, ISO, and other styles

30

Guo, Wenyue, Haiyan Liu, Anzhu Yu, and Jing Li. "RESEARCH ON VISUAL ANALYSIS METHODS OF TERRORISM EVENTS." ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences XLI-B2 (June 7, 2016): 191–96. http://dx.doi.org/10.5194/isprs-archives-xli-b2-191-2016.

Full text

Abstract:

Under the situation that terrorism events occur more and more frequency throughout the world, improving the response capability of social security incidents has become an important aspect to test governments govern ability. Visual analysis has become an important method of event analysing for its advantage of intuitive and effective. To analyse events’ spatio-temporal distribution characteristics, correlations among event items and the development trend, terrorism event’s spatio-temporal characteristics are discussed. Suitable event data table structure based on “5W” theory is designed. Then, six types of visual analysis are purposed, and how to use thematic map and statistical charts to realize visual analysis on terrorism events is studied. Finally, experiments have been carried out by using the data provided by Global Terrorism Database, and the results of experiments proves the availability of the methods.

APA, Harvard, Vancouver, ISO, and other styles

31

García-Pérez, Miguel A. "The Decade 1989–1998 in Spanish Psychology: An Analysis of Research in Statistics, Methodology, and Psychometric Theory." Spanish Journal of Psychology 4, no. 2 (November 2001): 111–22. http://dx.doi.org/10.1017/s1138741600005710.

Full text

Abstract:

This paper presents an analysis of research published in the decade 1989–1998 by Spanish faculty members in the areas of statistical methods, research methodology, and psychometric theory. Database search and direct correspondence with faculty members in Departments of Methodology across Spain rendered a list of 193 papers published in these broad areas by 82 faculty members. These and other faculty members had actually published 931 papers over the decade of analysis, but 738 of them addressed topics not appropriate for description in this report. Classification and analysis of these 193 papers revealed topics that have attracted the most interest (psychophysics, item response theory, analysis of variance, sequential analysis, and meta-analysis) as well as other topics that have received less attention (scaling, factor analysis, time series, and structural models). A significant number of papers also dealt with various methodological issues (software, algorithms, instrumentation, and techniques). A substantial part of this report is devoted to describing the issues addressed across these 193 papers–most of which are written in the Spanish language and published in Spanish journals–and some representative references are given.

APA, Harvard, Vancouver, ISO, and other styles

32

Waters, Erika A., Nicole Ackermann, and Courtney S. Wheeler. "Specifying Future Behavior When Assessing Risk Perceptions: Implications for Measurement and Theory." Medical Decision Making 39, no. 8 (October 24, 2019): 986–97. http://dx.doi.org/10.1177/0272989x19879704.

Full text

Abstract:

Background. Many theories assert that high perceived risk motivates health behavior change; the empirical literature shows mixed findings. Purpose. To determine whether, for whom, and under what circumstances specifying a future behavior when assessing perceived risk (i.e., “conditioning” risk perception items on behavior) improves data quality and strengthens the perceived risk-intentions/behavior relationship. Methods. Internet panel participants ( N = 787, 58.8% no college experience, 44.4% racial/ethnic minority, 43.7% men, 67.3% aged 18–49 years, 59.0% nonadherent to physical activity guidelines) answered 8 colon cancer perceived risk items in a within-subjects design. Participants answered 4 types of risk perception items: absolute and comparative perceived likelihood and absolute and comparative feelings of risk. Participants answered each type of item twice: once conditioned on not engaging in physical activity and once unconditioned. Results. Compared to unconditioned items, conditioned items elicited fewer “don’t know” (DK) responses (OR = 0.80; 95% CI, 0.68–0.93), higher risk perceptions ( b = 0.55; 95% CI, 0.49–0.61) and stronger positive correlations with intentions ( zSteiger = 5.46, P < 0.001) and behavior ( zSteiger = 5.10, P < 0.001). The effect of conditioning was more pronounced for perceived likelihood than feelings of risk items (OR = 2.21; 95% CI, 1.63–3.01 and b = 0.14; 95% CI, 0.08–0.20 for DK responding and risk perception magnitude, respectively). The effect on risk perception magnitude (except absolute feelings of risk) was higher among people with higher health literacy (χ2(3) = 8.11, P = 0.04). Conclusions. Researchers who examine whether perceived risk motivates precautionary behavior should consider conditioning risk perception items on behavior to increase the validity of the statistical conclusions they draw and to gain insight into the nature of perceived risk and its relation to behavior.

APA, Harvard, Vancouver, ISO, and other styles

33

Rajki, Veronika, Tibor Deutsch, Mária Csóka, and Judit Mészáros. "Modern tesztelméleti eszközök alkalmazása az ápolók transzfúziós ismereteinek objektív mérésében." Orvosi Hetilap 156, no. 37 (September 2015): 1497–508. http://dx.doi.org/10.1556/650.2015.30242.

Full text

Abstract:

Introduction: Clinical practice requires knowledge of health professionals for the application of problem solving of different tasks in various clinical settings. Therefore, a set of reliable measurement tools is required to assess various components of the professional knowledge including factual knowledge, skills and competence as outcomes of nursing education and training. Traditional questionnaires and test methods do not allow the measurement of these characteristics properly, as these tools typically provide overall scores without relevant insight into areas in which nurses perform well, and those where their knowledge and/or skills should be improved substantially. Aim: The aim of this nationwide survey conducted among nurses between November 2014 and February 2015 was to determine how the teaching/training objectives have been achieved in the nurses’ transfusion practice. The authors attempted to exceed the capabilities of classical test theory and acquire a detailed picture about what the nurses know about transfusion therapy and how they are involved and behave in routine clinical practice. Method: The knowledge and skills of 657 participants were assessed using a validated instrument consisting of a set of questions covering every aspects of transfusion therapy. The answers to these items were evaluated on a binary (good or bad) scale. Recorded answers of the participants were analysed using hierarchical cluster analysis and item response theory tools such as the one-parametric Rasch model suitable for dichotomous data. Data analysis was performed with the SPSS program and the ltm module of the R statistical program. Results: The paper presents the distribution of correct and incorrect answers to various questions about transfusion therapy along with the corresponding logit values and odds ratios, respectively. The characteristic curves of each item were determined on the basis of the number of correct answers that have been recorded. These curves highlight which questions were answered easily and which items were found harder to answer by the nurses who participated in the survey. In addition to the separate analysis of individual questions, a set of response patterns is also presented which shows how frequently the nurses responded correctly to different combinations (sub-sets) of questions. On the whole, nurses exhibited medium level performance in terms of knowledge and skills required for efficient and safe transfusion practice. Conclusions: Objective and reliable measurement of the level of acquired knowledge is a key requirement in nursing education. This paper, which demonstrates the use of cluster analysis and item response theory for the assessment of transfusion-related knowledge of nurses, focuses on this issue for the first time in nursing research. The results of this survey have revealed substantial limitations and deficiencies both in knowledge and skills of nurses which need to be addressed by training in order to improve the efficiency and safety of transfusion therapy. Orv. Hetil., 2015, 156(37), 1497–1508.

APA, Harvard, Vancouver, ISO, and other styles

34

von Davier, Matthias. "Detecting and treating errors in tests and surveys." Quality Assurance in Education 26, no. 2 (April 3, 2018): 243–62. http://dx.doi.org/10.1108/qae-07-2017-0036.

Full text

Abstract:

Purpose Surveys that include skill measures may suffer from additional sources of error compared to those containing questionnaires alone. Examples are distractions such as noise or interruptions of testing sessions, as well as fatigue or lack of motivation to succeed. This paper aims to provide a review of statistical tools based on latent variable modeling approaches extended by explanatory variables that allow detection of survey errors in skill surveys. Design/methodology/approach This paper reviews psychometric methods for detecting sources of error in cognitive assessments and questionnaires. Aside from traditional item responses, new sources of data in computer-based assessment are available – timing data from the Programme for the International Assessment of Adult Competencies (PIAAC) and data from questionnaires – to help detect survey errors. Findings Some unexpected results are reported. Respondents who tend to use response sets have lower expected values on PIAAC literacy scales, even after controlling for scores on the skill-use scale that was used to derive the response tendency. Originality/value The use of new sources of data, such as timing and log-file or process data information, provides new avenues to detect response errors. It demonstrates that large data collections need to better utilize available information and that integration of assessment, modeling and substantive theory needs to be taken more seriously.

APA, Harvard, Vancouver, ISO, and other styles

35

Annetta, Leonard, Richard Lamb, Denise M. Bressler, and David B. Vallett. "Cognitive Modeling of Learning Using Big Data From a Science-Based Game Development Environment." International Journal of Game-Based Learning 10, no. 4 (October 2020): 22–39. http://dx.doi.org/10.4018/ijgbl.2020100102.

Full text

Abstract:

The purpose of this study was to identify the underlying cognitive attributes used during the design and development of science-based serious educational games. Study methods rely on a modification of cognitive diagnostics, item response theory, and Bayesian estimation with traditional statistical techniques such as factor analysis and model fit analysis to examine the data and model structure. A computational model of the cognitive processing using an artificial neural network (ANN) allowed for examination of underlying mechanisms of cognition from a server-side data set and a 21st century skills assessment. ANN results indicate that the model correctly predicts successful completion of science-based serious educational game (SEG) design tasks related to 21st century skills 86% of the time and correctly predicts failure to complete SEG design tasks related to 21st century skills 78% of the time. The model also reveals the relative importance of each particular cognitive attribute within the 21st century skills framework.

APA, Harvard, Vancouver, ISO, and other styles

36

Martínez-Guerreroa, José I. "A critical review of antecedents of psychological measurement: Is it necessary to revisit or reorganized the foundations Psychometry?" Journal of Basic and Applied Psychology Research 3, no. 5 (July 5, 2021): 1–10. http://dx.doi.org/10.29057/jbapr.v3i5.6774.

Full text

Abstract:

The purpose of present essay was revisited and supplemented antecedents on psychological measurement and analyzed his foundations in a scientific perspective. This review includes since early great pioneers of measurement at the beginning of nineteen century to the principal contributions of psychometrics in the middle of twenty century. The researcher’ knowledge on the Science History in general and the analysis of theories and methodological antecedents of their discipline are very important to an integral scientific’ formation. The problems and challenges of science are showed an interdisciplinary approach that permits the comprehension of scientific concepts and methods, to improve and reorganized each discipline with a broad and new perspective. The roots of Psychometry included pioneers and contributors in Mathematical, Statistical, and Experimental Sciences, who’s had persisted with creativity in the development of theories, methods, models, technics and procedures to construct psychological measurements in social and behavioral sciences. Since the Rasch’ Model and the Item Response Theory, among others advances in multivariate statistics, the modern Psychometry have theories, models, applied strategies and methods to document validity and reliability evidence. Psychometrics today, counts with standards to guide best practices in test development and validation in behavioral sciences.

APA, Harvard, Vancouver, ISO, and other styles

37

Schreck, Christopher J., Mark T. Berg, Bonnie S. Fisher, and Pamela Wilcox. "That Door You Just Kicked in Was Locked for Your Protection, Not Mine." Journal of Research in Crime and Delinquency 55, no. 2 (January 3, 2018): 316–45. http://dx.doi.org/10.1177/0022427817751570.

Full text

Abstract:

Objectives: This research investigates tendencies for individuals to preference adopting weaponry to protect their home over unarmed defensive measures such as installing a lock or alarm. We extend the subculture of violence perspective to account for specific choices and test this approach against hypotheses related to situational reactions to threat. Methods: To explore differential preferences in crime prevention choices for protecting the home, we use data from 1,961 Seattle adults, collected during 2002 to 2003. We employ Osgood and Schreck’s multilevel item response theory–based method as our statistical approach. Results: The results indicate that those who endorse the values of the subculture of violence are more likely to have defensive weaponry among their countermeasures against crime, although the results also show that those who believe the police act justly are more likely to procure weapons. Situational reactions to threat also influenced choices, though not always in the predicted direction. Conclusions: Beliefs may be an important determinant for how people protect themselves against crime. Further, criminological theory can successfully explain crime prevention choices in the general population, indicating considerable untapped future directions for research.

APA, Harvard, Vancouver, ISO, and other styles

38

Xu, Ting, and Clement A. Stone. "Using IRT Trait Estimates Versus Summated Scores in Predicting Outcomes." Educational and Psychological Measurement 72, no. 3 (October 11, 2011): 453–68. http://dx.doi.org/10.1177/0013164411419846.

Full text

Abstract:

It has been argued that item response theory trait estimates should be used in analyses rather than number right (NR) or summated scale (SS) scores. Thissen and Orlando postulated that IRT scaling tends to produce trait estimates that are linearly related to the underlying trait being measured. Therefore, IRT trait estimates can be more useful than summated scores when examining relationships between test scores and external variables. Also, when the model holds, IRT trait estimates possess an interval scale that is a property assumed for dependent variables by most statistical procedures used in educational research. The objective of this study was to use Monte Carlo methods to compare the performance of IRT trait estimates and SS scores in predicting outcome variables in the context of health and behavioral assessment. The use of scores based on the graded-response model versus summated scores was compared. Results indicated that IRT-based scores and summated scores are comparable when evaluating the relationships between test scores and outcome measures. Thus, applied researchers could use summated scores in predictive studies and circumvent evaluating the assumptions underlying use of IRT-based scores.

APA, Harvard, Vancouver, ISO, and other styles

39

Chatfield, Sheryl L., Abigail Gamble, and Jeffrey S. Hallam. "Men’s Preferences for Physical Activity Interventions: An Exploratory Study Using a Factorial Survey Design Created With R Software." American Journal of Men's Health 12, no. 2 (April 8, 2016): 347–58. http://dx.doi.org/10.1177/1557988316643316.

Full text

Abstract:

Effective exercise interventions are needed to improve quality of life and decrease the impact of chronic disease. Researchers suggest males have been underrepresented in exercise intervention studies, resulting in less understanding of their exercise practices. Findings from preference survey methods suggest reasonable association between preference and behavior. The purpose of the research described in this article was to use factorial survey, a preference method, to identify the characteristics of exercise interventions most likely to appeal to male participants, so preferences might be incorporated into future intervention research. The research was guided by the framework of Bandura’s social cognitive theory, such that variations in individual, environmental, and behavioral factors were incorporated into vignettes. Participants included 53 adult male nonadministrative staff and contract employees at a public university in the Southeastern United States, who each scored 8 vignettes resulting in 423 observations. Multilevel models were used to assess the influence of the factors. Participants scored vignettes that included exercising with a single partner, playing basketball, and exercising in the evening higher than vignettes with other options. Qualitative analysis of an open response item identified additional alternatives in group size, participant desire for coaching support, and interest in programs that incorporate a range of activity alternatives. Findings from this research were consistent with elements of social cognitive theory as applied to health promotion. Factorial surveys potentially provide a resource effective means of identifying participants’ preferences for use when planning interventions. The addition of a single qualitative item helped clarify and expand findings from statistical analysis.

APA, Harvard, Vancouver, ISO, and other styles

40

Kukharenko, V. M., L. P. Perkhun, and N. M. Tovmachenko. "The Method for Comprehensive Quality Evaluation of Tests. Part 1." Statistics of Ukraine 82, no. 3 (September 4, 2018): 40–48. http://dx.doi.org/10.31767/su.3(82)2018.03.04.

Full text

Abstract:

Informatization of the modern society has led to the wide-scale and rapid introduction of distance training technologies in virtually all the categories of Ukrainian HEEs. Studies of the aspects related to applications of digital technologies in education processes are subject to close attention in Ukraine and beyond. An important component of the training process is test control of knowledge. Education activities at the National Academy of Statistics, Accounting and Audit rely on criteria oriented tests. They are realized on the basis of distance training system Moodle that allows for creating test questions of various types and their repeated inclusion in various packages of test tasks. The Moodle environment allows for computation of selected statistical indicators on a fulfilled test and its individual tasks: average estimate and median, standard deviation, asymmetry, excess, internal agreement rate, standard error etc. However, these characteristics are not enough for justified acceptance of test results. The article presents the first phase in elaborating a comprehensive method for quality evaluation of selected test tasks and the test as a whole. This method combining the classical theory, Data Mining and Item Response theory methods involves six steps. The first step, based on indicators of descriptive statistics, allows for evaluating the obtained distribution of test results. The second step involves evaluation of the validity of test tasks. The point-bead ratio is computed to derive the correlation between individual test task and individual test score of a student, with values higher than 0.5 considered satisfactory. Pearson correlation coefficient for binary variables shows the correlation between pairs of test tasks. The test tasks with negative correlations with the other test tasks are not considered as valid, and they have to be corrected or replaced. At the third step, the factor validity of the test is evaluated. The test tasks combined in groups using factor analysis methods are subject to further analysis to determine their impact on the final result, the individual test score of a student. All the above mentioned steps are illustrated by example. The computation is made by SPSS software package. The difference in interpretation of the computation results in each step for norm oriented and criteria oriented tests is demonstrated. The description of further steps involves in the method for comprehensive quality evaluation of tests, which use Data Mining and Item Response Theory methods, will be continued in next publications.

APA, Harvard, Vancouver, ISO, and other styles

41

Panaretos, Dimitris, Efi Koloverou, Alexandros C. Dimopoulos, Georgia-Maria Kouli, Malvina Vamvakari, George Tzavelas, Christos Pitsavos, and Demosthenes B. Panagiotakos. "A comparison of statistical and machine-learning techniques in evaluating the association between dietary patterns and 10-year cardiometabolic risk (2002–2012): the ATTICA study." British Journal of Nutrition 120, no. 3 (May 23, 2018): 326–34. http://dx.doi.org/10.1017/s0007114518001150.

Full text

Abstract:

AbstractStatistical methods are usually applied in examining diet–disease associations, whereas factor analysis is commonly used for dietary pattern recognition. Recently, machine learning (ML) has been also proposed as an alternative technique in health classification. In this work, the predictive accuracy of statistical v. ML methodologies as regards the association of dietary patterns on CVD risk was tested. During 2001–2002, 3042 men and women (45 (sd 14) years) were enrolled in the ATTICA study. In 2011–2012, the 10-year CVD follow-up was performed among 2020 participants. Item Response Theory was applied to create a metric of combined 10-year cardiometabolic risk, the ‘Cardiometabolic Health Score’, that incorporated incidence of CVD, diabetes, hypertension and hypercholesterolaemia. Factor analysis was performed to extract dietary patterns, on the basis of either foods or nutrients consumed; linear regression analysis was used to assess their association with the cardiometabolic score. Two ML techniques (k-nearest-neighbor’s algorithm and random-forests decision tree) were applied to evaluate participants’ health based on dietary information. Factor analysis revealed five and three factors from foods and nutrients, respectively, explaining 54 and 65 % of the total variation in intake. Nutrient and food pattern regression models showed similar accuracy in correctly classifying an individual according to the cardiometabolic risk (R2=9·6 % and R2=8·3 %, respectively). ML techniques were superior compared with linear regression in correct classification of the individuals according to the Health Score (accuracy approximately 38 v. 6 %, respectively), whereas the two ML methods showed equal classification ability. Conclusively, ML methods could be a valuable tool in the field of nutritional epidemiology, leading to more accurate disease-risk evaluation.

APA, Harvard, Vancouver, ISO, and other styles

42

Gries, Katharine S., John Fastenau, Charlie Iaconangelo, and Daniel Serrano. "Measuring Patient Reported Outcomes in Multiple Myeloma: Are Legacy Instruments Fit for Purpose." Blood 132, Supplement 1 (November 29, 2018): 4862. http://dx.doi.org/10.1182/blood-2018-99-117223.

Full text

Abstract:

Abstract Introduction: Measuring patient-reported outcomes (PROs) in oncology clinical trials is important to capture the patient perspective on quality of survival and the outcomes of health-related quality of life (HRQoL), symptoms, and functional status. In multiple myeloma (MM) randomized clinical trials (RCTs) the common PRO instruments include the EORTC QLQ-C30 and the EQ-5D as generic measures of a patient's health status. While these legacy PRO instruments have been widely used in the MM population, it is important to continue to assess the instruments psychometric properties for the specific context of use and target population. The purpose of this research was to investigate the strengths and limitations of the PRO instruments when analyzed free from developer construct and scoring recommendations to assess fit for purpose in various MM clinical trial populations. Methods: Data for the analysis included 3 MM RCTs (n=1,773) (Dimopoulos 2016, Palumbo 2016, Mateos 2018). The PRO analysis set consisted of 33 items from the EORTC QLQ-C30 and EQ-5D-5L collected at baseline and during the treatment phase of the RCTs. Item analysis and modern psychometric methods were applied to the PRO item pool as a 4-step process: (1) descriptive analysis of the items, (2) assessment of factor structure, (3) performance of factors across RCT populations (treatment naive, relapsed/refractory MM), and (4) assessment of meaningful change. Items were characterized using polychoric correlations to examine the inter-item association and relationships before determining appropriate factor structures. An exploratory factor analysis (EFA) followed by multidimensional item response theory (MIRT) and bifactor IRT models were conducted to determine how items fit into factors and to test a total symptom and total impact scoring option. Differential item functioning (DIF) was conducted to evaluation item and domain performance across study populations (treatment naïve vs. relapsed/refractory) and longitudinally (baseline vs. week 24). Meaningful change in these new domain constructs was evaluated using anchor-based techniques based on the EORTC-QLQ-C30 and EQ-5D-5L global items. A novel provisional latent class model approach creating responder groups was also developed and applied to assess meaningful change. Results: The 33 items from the EORTC QLQ-C30 and EQ-5D-5L demonstrated strong inter-item correlations supportive of pre-specified domain structures and consistent across RCT study populations. The EFA supported 3-domain structures for both symptoms and impacts separately. These domains were broadly consistent with the PRO instrument developer specifications but unique in several regards. These domains were statistically identical across RCTs. Model fit showed that a proposed bi-factor IRT (Symptom domain and Impact domain) fit better than the MIRT suggesting that subdomains are less useful than total domain scores. An extensive DIF analysis within this IRT framework found no evidence of substantial DIF effects. All evidence indicated that the items performed similarly across sample populations and time points. Anchor-based meaningful change calculations for the newly created total symptom score and the total impact score demonstrated separation between global health anchor groups but the meaningful change estimates were small. The latent class models demonstrated that treatment arms differed significant in responder rates within 1 RCT. Conclusions: The analyses confirmed psychometrically robust EORTC QLQ-C30 and EQ-5D-5L items in the MM population under newly specified factor structures. In addition, performance of the items allows for calculation of a total symptom score and total impacts score. The newly developed total domains performed identically across MM populations (treatment naïve versus relapsed/refractory). The results only focus on the on-treatment phase of the RCT so future research is needed to determine how psychometric methods can be adjusted to better accommodate survival status and censoring for progression in oncology RCTs. The psychometric properties and consistency of the findings across the newly created symptom and impact total domain structures support the results that daratumumab maintains HRQoL in patients with MM. Disclosures Gries: Janssen Research & Development, LLC: Employment. Fastenau:Janssen Research & Development, LLC: Employment. Iaconangelo:Janssen Research &Development, LLC: Consultancy. Serrano:Janssen Research & Development, LLC: Consultancy.

APA, Harvard, Vancouver, ISO, and other styles

43

Scalise, Kathleen. "Hybrid Measurement Models for Technology-Enhanced Assessments Through mIRT-bayes." International Journal of Statistics and Probability 6, no. 3 (May 14, 2017): 168. http://dx.doi.org/10.5539/ijsp.v6n3p168.

Full text

Abstract:

Technology-enhanced assessments (TEAs) are rapidly emerging in educational measurement. In contexts such as simulation and gaming, a common challenge is handling complex streams of information, for which new statistical innovations are needed that can provide high quality proficiency estimates for the psychometrics of complex TEAs. Often in educational assessments with formal measurement models, latent variable models such as item response theory (IRT) are used to generate proficiency estimates from evidence elicited. Such robust techniques have become a foundation of educational assessment, when models fit. Another less common approach to compile evidence is through Bayesian networks, which represent a set of random variables and their conditional dependencies via a directed acyclic graph. Network approaches can be much more flexibly designed for complex assessment tasks and are often preferred by task developers, for technology-enhanced settings. However, the Bayesian network-based statistical models often are difficult to validate and to gauge the stability and accuracy, since the models make assumptions regarding conditional dependencies that are difficult to test. Here a new measurement model family, mIRT-bayes, is proposed to gain advantages of both latent variable models and network techniques combined through hybridization. Specifically, the technique described here embeds small Bayesian networks within an overarching multidimensional IRT model (mIRT), preserving the flexibility for task design while retaining the robust statistical properties of latent variable methods. Applied to simulation-based data from Harvard's Virtual Performance Assessments (VPA), the results of the new model show acceptable fit for the overarching mIRT model, along with reduction of the standard error of measurement through the embedded Bayesian networks, compared to use of mIRT alone. Overall for respondents, a finer grain-size of inference is made possible without additiona testing time or scoring resources, showing potentially promise for this family of new hybrid models.

APA, Harvard, Vancouver, ISO, and other styles

44

Gomes, Diego Eller, José Luís Guedes Dos Santos, José Wicto Pereira Borges, Murilo Pedroso Alves, Dalton Francisco De Andrade, and Alacoque Lorenzini Erdmann. "Teoria da resposta ao item nas pesquisas em saúde pública." Revista de Enfermagem UFPE on line 12, no. 6 (June 2, 2018): 1800. http://dx.doi.org/10.5205/1981-8963-v12i6a234740p1800-1812-2018.

Full text

Abstract:

RESUMOObjetivo: analisar a utilização da Teoria da Resposta ao Item (TRI) nas pesquisas em saúde pública. Método: revisão integrativa, sem delimitação temporal, nas bases de dados LILACS, SCOPUS, MEDLINE/PUBMED e na biblioteca virtual SCIELO. Adotaram-se as etapas de leitura, contagem de ocorrência de características das publicações e organização em categorias temáticas no processo de análise. Resultados: 56 estudos atenderam aos critérios de inclusão que buscaram avaliar a validade e a confiabilidade de instrumentos de mensuração e realizar ajustes psicométricos. Destacaram-se, como traços latentes mais avaliados, a qualidade de vida, a saúde de idosos e o letramento em saúde. Conclusão: evidenciam-se as possibilidades de utilização da TRI para a avaliação de uma diversidade de traços antes relacionados à saúde pública ao permitir a transformação de resultados subjetivos em probabilidades. O desenvolvimento de novos instrumentos pode considerar a avaliação de traços latentes relevantes para a saúde pública, como a integralidade da assistência na atenção básica, o vínculo do usuário aos serviços de saúde e perspectivas sobre vulnerabilidades, contribuindo para o avanço do conhecimento científico. Descritores: Métodos e Procedimentos Estatísticos; Inquéritos e Questionários; Análise Quantitativa; Saúde Pública; Enfermagem; Avaliação em Enfermagem.ABSTRACTObjective: to analyze the use of Item Response Theory (IRT) in public health research. Method: integrative review, without temporal delimitation, in the databases LILACS, SCOPUS, MEDLINE / PUBMED and in the SCIELO virtual library. We adopted the steps of reading, counting of occurrence of characteristics of the publications and organization in thematic categories in the process of analysis. Results:56 studies met the inclusion criteria that sought to evaluate the validity and reliability of measurement instruments and to make psychometric adjustments. The quality of life, the health of the elderly and the health literacy were highlighted as more evaluated latent traits. Conclusion: the possibilities of using IRT for the evaluation of a diversity of traits previously related to public health are evidenced by allowing the transformation of subjective results into probabilities. The development of new instruments may consider the evaluation of latent traits relevant to public health, such as the integrality of care in basic care, the user's link to health services and perspectives on vulnerabilities, contributing to the advancement of scientific knowledge. Descriptors: Statistical Methods and Procedures; Surveys and Questionnaires; Quantitative Analysis; Public Health; Nursing; Nursing Assessment.RESUMENObjetivo: analizar la utilización de la Teoría de la Respuesta al Ítem (TRI) en las investigaciones en salud pública. Método: revisión integrativa, sin delimitación temporal, en las bases de datos LILACS, SCOPUS, MEDLINE / PUBMED y en la biblioteca virtual SCIELO. Se adoptaron las etapas de lectura, conteo de ocurrencia de características de las publicaciones y organización en categorías temáticas en el proceso de análisis. Resultados: 56 estudios atendieron a los criterios de inclusión, que buscaron evaluar la validez y la confiabilidad de instrumentos de medición y realizar ajustes psicométricos. Se destacaron como rasgos latentes más evaluados, la calidad de vida, la salud de ancianos y el letramiento en salud. Conclusión: se evidencian las posibilidades de utilización de la TRI para la evaluación de una diversidad de rasgos latentes relacionados a la salud pública, al permitir la transformación de resultados subjetivos en probabilidades. El desarrollo de nuevos instrumentos puede considerar la evaluación de rasgos latentes relevantes para la salud pública, como la integralidad de la asistencia en la atención básica, el vínculo del usuario a los servicios de salud y perspectivas sobre vulnerabilidades, contribuyendo al avance del conocimiento científico. Descriptores: Métodos y Procedimientos Estadísticos; Encuestas y Questionarios; Análisis Cuantitativo; Salud Pública; Enfermería; Evaluación em Enfermería.

APA, Harvard, Vancouver, ISO, and other styles

45

Acaster, S., P. Swinburn, C. Wang, B. Stemper, K. Beckmann, V. Knappertz, C. Pohl, et al. "Can the functional assessment of multiple sclerosis adapt to changing needs? A psychometric validation in patients with clinically isolated syndrome and early relapsing–remitting multiple sclerosis." Multiple Sclerosis Journal 17, no. 12 (July 14, 2011): 1504–13. http://dx.doi.org/10.1177/1352458511414039.

Full text

Abstract:

Background: The Functional Assessment of Multiple Sclerosis (FAMS) is widely used in clinical trial programmes; however, it was developed before the rise in trials targeted at early stage multiple sclerosis (MS) and clinically isolated syndrome (CIS). Objective: The aim of this study was to assess the psychometric properties of the FAMS within two clinically distinct populations, CIS and early relapsing–remitting MS (RRMS), and discern the appropriateness of the FAMS within these populations. Methods: Secondary analysis was conducted on FAMS data from two clinical trials assessing interferon beta–1b in early RRMS and CIS. The statistical analysis assessed the scale acceptability, reliability, validity and responsiveness of the FAMS. Item response theory (IRT) was also conducted on the early RRMS sample in order to assess how well the FAMS discriminated amongst individuals with less severe MS. Results: Results from both trials demonstrated an improvement in the FAMS psychometric properties with increased baseline disease severity. However, high ceiling effects were evident amongst less severe patients, and there was an overall lack of responsiveness to improvement and poor construct validity. IRT also demonstrated its lack of discrimination/sensitivity in early RRMS. Conclusions: In trials involving patients with early stage RRMS and CIS, modifications to the FAMS based on a qualitative assessment of its content validity in these populations would be required in order to potentially improve the FAMS psychometric properties and sensitivity.

APA, Harvard, Vancouver, ISO, and other styles

46

Shitu, Kegnie, Mekuriaw Alemayehu, Yvonne A. B. Buunk-Werkhoven, and Simegnew Handebo. "Determinants of intention to improve oral hygiene behavior among students based on the theory of planned behavior: A structural equation modelling analysis." PLOS ONE 16, no. 2 (February 25, 2021): e0247069. http://dx.doi.org/10.1371/journal.pone.0247069.

Full text

Abstract:

Introduction The prevalence of oral hygiene behaviors (OHB) is very low among school children in Ethiopia. However, the determinants of student’s readiness/intention to perform those behaviors have been remained unstudied. Objective This study aimed to identify the determinants of oral hygiene behavioral intention (OHBI) among preparatory school students based on the theory of planned behavior (TPB). Methods and materials An institution-based cross-sectional study was conducted among 393 students. A 98-item self-administered questionnaire was used to evaluate oral hygiene knowledge (OHK), oral hygiene behavior (OHB), and OHBI based on TPB variables [attitude (ATT), subjective norms (SN) and perceived behavioral control (PBC)]. Descriptive statistics and structural equation modeling analysis (SEM) were employed to confirm relationships and associations among study variables. A p-value of less than 0.05 and a 95% confidence interval were used to declare statistical significance. Results A total of 393 students were participated with a response rate of 97.5%. The mean age of the participants (54% females) was 18 (± 1.3) with an age range of 16 to 24. The TPB model was well fitted to the data and explained 66% of the variance in intention. ATT (β = 0.38; 95% CI, (0.21, 0.64)), SN (β = 0.33; 95% CI, (0.05, 0.83)) and PBC (β = 0.29; 95% CI, (0.13, 0.64)) were significant predictors of OHBI, where ATT was the strongest predictor of OHBI. Conclusion The TPB model explained a large variance in the intention of students to improve their OHB. All TPB variables were significantly and positively linked to stronger intent, as the theory suggests. Furthermore, these results suggest that the model could provide a framework for oral hygiene promotion interventions in the study area. Indeed, these interventions should focus on changing the attitudes of students towards OHB, creation of positive social pressure, and enabling students to control over OHB barriers.

APA, Harvard, Vancouver, ISO, and other styles

47

Talkhan, H., D. Stewart, T. McIntosh, H. Ziglam, P. V. Abdulrouf, M. Al-Hail, M. Diab, and S. Cunningham. "Using the Theoretical Domains Framework to investigate clinicians’ behavioural determinants of antimicrobial prescribing in Qatar." International Journal of Pharmacy Practice 29, Supplement_1 (March 26, 2021): i20—i22. http://dx.doi.org/10.1093/ijpp/riab015.025.

Full text

Abstract:

Abstract Introduction A recent systematic review by Talkhan et al demonstrated the need for theoretically based behaviour change interventions in this area. [1] For development of such complex interventions, emphasis should be placed on using theory to systematically identify behavioural determinants of antimicrobial prescribing. Aim To identify and quantify clinicians’ behavioural determinants of antimicrobial prescribing in Qatar. Methods This cross-sectional survey is part of a multi-phase explanatory, sequential mixed methods PhD project in Qatar. Questionnaires were distributed (online and paper based) to all doctors (~4,000) and pharmacists (~400) within Hamad Medical Corporation (HMC, the main healthcare provider). The questionnaire was developed with reference to the Determinants of Implementation Behaviour Questionnaire (a generic questionnaire derived from the 14 theoretical domains of the Theoretical Domains Framework, TDF). [2] Each item was presented as a 5-point Likert scale (scored 5=Strongly agree to 1=Strongly disagree). Personal and practice demographics were also collected for data contextualisation. The draft questionnaire was reviewed for face/content validity by an expert panel of six researchers in Qatar and the UK with experience in the use of the TDF, followed by ‘Think aloud’ testing and piloting. Analysis investigated the behavioural determinants and influential factors through descriptive, principal component analysis (PCA) and inferential analysis. Ethics approval was granted from a UK university and HMC. Results In total, 535 responses were received, 339 (63.4%) from doctors and 196 (36.6%) from pharmacists. Respondents were predominantly male, 346 (64.7%). Just over half (n=285, 53.3%) had ≤ 5 years’ experience as health professionals. PCA showed a three component (C) solution with components incorporating a number of questionnaire items labelled: ‘Guidelines compliance’ (C1 with 8 items), ‘Influences on prescribing’ (C2 with 7 items) and ‘Self-efficacy’ (C3 with 5 items) in prescribing/recommendation activity. A scale score for each respondent was calculated through summation of Likert scores for the relevant questionnaire items within each component. These scales had high internal reliability (Cronbach’s alpha all >0.7) showing consistency in response between component items indicating statistical appropriateness for developing scales. The median score (possible scale range, midpoint) for each scale was C1, 32 (8 to 40, 24), C2, 26 (7 to 35, 21) and C3, 20, (5 to 25, 15). By way of example Table 1 shows levels of agreement for items in C2. This shows lower levels of agreement than C1 scale with the median scale score (26) closer to the midpoint (21) indicating that respondents had less positive views. Inferential analysis using these scale scores and free text analysis is in progress. Conclusion A theoretical basis was used throughout providing insights to behavioural determinants for the development of a theory-based behaviour change intervention. Preliminary results suggest that social influences, staff development and quality monitoring may be useful targets for behaviour change interventions to improve antimicrobial prescribing practice. Limitations include potential social desirability bias and focus on one healthcare organisation/country in the Middle East which may limit generalisability of findings. More in-depth exploration is required to select and test appropriate linked theory-based behaviour change techniques. References 1. Talkhan H, Stewart D, McIntosh T, Ziglam H Palli Valapila, A; Moza Sulaiman H, Diab M, Cunningham S. The use of theory in the development and evaluation of behaviour change interventions to improve antimicrobial prescribing: a systematic review. J Antimicrob Chemother. 2020;75(9):2394–2410, Available from https://doi.org/10.1093/jac/dkaa154 [Accessed 12 Oct 2020]. 2. Huijg JM, Gebhardt WA, Dusseldorp E, Verheijden MW, van der Zouwe N, Middelkoop BJ, Crone MR. Measuring determinants of implementation behaviour: psychometric properties of a questionnaire based on the Theoretical Domains Framework. Implement. Sci. 2014;9(1):33.

APA, Harvard, Vancouver, ISO, and other styles

48

Reise, Steven P., Andrew T. Ainsworth, and Mark G. Haviland. "Item Response Theory." Current Directions in Psychological Science 14, no. 2 (April 2005): 95–101. http://dx.doi.org/10.1111/j.0963-7214.2005.00342.x.

Full text

Abstract:

Item response theory (IRT) is an increasingly popular approach to the development, evaluation, and administration of psychological measures. We introduce, first, three IRT fundamentals: (a) item response functions, (b) information functions, and (c) invariance. We next illustrate how IRT modeling can improve the quality of psychological measurement. Available evidence suggests that the differences between IRT and traditional psychometric methods are not trivial; IRT applications can improve the precision and validity of psychological research across a wide range of subjects.

APA, Harvard, Vancouver, ISO, and other styles

49

USAMI, SATOSHI, KOJIRO SHOJIMA, HARUHIKO MITSUNAGA, and NAOYA TODO. "Methods and Practices in Item Response Theory:." Annual Report of Educational Psychology in Japan 58 (March 30, 2019): 321–29. http://dx.doi.org/10.5926/arepj.58.321.

Full text

APA, Harvard, Vancouver, ISO, and other styles

50

Shanmugam, Ramalingam. "Handbook of item response theory: volume two, statistical tools." Journal of Statistical Computation and Simulation 90, no. 12 (June 13, 2019): 2306. http://dx.doi.org/10.1080/00949655.2019.1628907.

Full text

APA, Harvard, Vancouver, ISO, and other styles

We offer discounts on all premium plans for authors whose works are included in thematic literature selections. Contact us to get a unique promo code!