Description
Please write a report in 800-1,200 (12 point font, double-spaced) words and cite appropriately. You need to have references to introduce the background of your condition. In-text citations and the list of references should follow APA 7th edition style. The report should consist of 4 to 5 paragraphs: introduction, two or three paragraphs of the main body, and conclusion. The topics can include but are not limited to background, causes of the disease, treatments, status, influential factors, and a particular reason of your interest.
The purpose of the phase 1 report is 1) to select a dataset for the term project between the MEPS and FAERS datasets, 2) to decide a medical condition for which you want to conduct data analyses, and 3) to have a basic understanding of the medical condition, such as the background, causes of the disease, treatments, status, and influential factors. You also need to prepare a sample dataset by utilizing one of two Jupyter notebooks provided in week 2 and week 6. For the data analysis, you should have at least 500 data instances. Therefore, please find a medical condition of interest and then check whether there are 500 data instances. More details will be explained below.
Dataset
Medical Expenditure Panel Survey
MEPS is a set of large-scale surveys of families and individuals, their medical providers, and employers across the United States. MEPS is a complete source of data on the cost and use of health care and health insurance coverage in the US. For more details, please visit the official website (Links to an external site.) and Github repository (Links to an external site.) and read the attached appendix document Download attached appendix document(p.1 – 13).
MEPS data consists of various variables such as medical condition, socioeconomic factors (e.g., gender, region, race, and family income), and medical expenditure. MEPS data also consist of various files such as person-level (e.g., health status, demographics, and total $$ of care), event-level (e.g., healthcare service use), and condition-level (e.g., medical condition). For the full review of those variables, please look at codebooks (person-level (Links to an external site.)) and condition-level (Links to an external site.)). I also coded those variables regarding usefulness for analysis (included vs. excluded, Heejun_Inclusion field) and variable type (independent vs. dependent, Heejun_Variable_Type field). You can find my version of the codebook from this link (Links to an external site.). In particular there are some dependent variables you can utilize:
Total health expenditures
Total inpatient expenditures
Total emergency care expenditures
Severity of Illness (attacks/year)
Number of School Days Missed (Children)
Number of Work Days Missed (Adult)
You should explore the dataset in depth to understand what you can do and to decide what you will do. It is a complex dataset, and you need to merge a number of files into one for your project. Do not feel overwhelmed. I will introduce all procedures step by step.
Depending on the medical condition (e.g., allergic rhinitis), research goals you can set will include but are not limited to:
Predict the yearly medical expenditure of persons with allergic rhinitis
Compare healthcare costs in different social determinant factors (e.g., sex, region, family income, and race)
Find relationships between allergic rhinitis and environmental factors
FDA Adverse Event Reporting System (FAERS)
The FDA’s Adverse Event Reporting System (FDA FAERS) is one of the databases containing adverse drug event (ADE) reports related to drugs, biologics, and certain other medical products. The FDA received more than 1.8 million new ADE reports in 2017, and the total number of reports from FAERS exceeds 15.9 million. The FDA FAERS data is publicly available and free to all users. This is a very rich resource for researchers working on pharmacovigilance. For more details, please visit the official website (Links to an external site.) and read the attached readme file Download readme file. Some available variables are:
Suspect product(s)
Concomitant product(s)
Name(s) of the AE(s)
Seriousness assessment of the AE (ie, serious or nonserious)
Patient age and sex
Country of incidence of AE
This is also a complex dataset. You should explore the dataset in depth to understand what you can do and to decide what you will do. Based on the medical condition (e.g., allergic rhinitis) you selected, research goal will be to discover adverse drug reactions.
What to Do for the Phase 1 Report
Explore the MEPS and FAERS datasets by utilizing two Jupyter notebooks (merging_FAERS.ipynb and week6_data_processing_MEPS.zip).
Decide a medical condition of your group’s interest.
You can use the same medical condition from the previous semester or find a new medical condition.
Select a dataset for the term project between the MEPS and FAERS datasets. (Links to an external site.)
By utilizing one of the notebooks in step 1, prepare a sample data consisting of at least 500 data instances.
The notebook you used to create the sample data should be submitted.
The sample data also need to be uploaded.
Write a report
You can utilize your previous phase 1 report from the previous course but should rewrite most of the parts. A similarity score from TurnItIn up to 40% will be allowed excluding references and quotations.
Please consider the comments I left to improve your report. If you don’t have it, you can request it by email.
Writing it again will help you improve your writing skill.
Please note that you can use the same condition while using a new dataset (i.e., FAERS dataset)
How to write
Please write a report in 800-1,200 (12 point font, double-spaced) words and cite appropriately. You need to have references to introduce the background of your condition. In-text citations and the list of references should follow APA 7th edition style. The report should consist of 4 to 5 paragraphs: introduction, two or three paragraphs of the main body, and conclusion. The topics can include but are not limited to background, causes of the disease, treatments, status, influential factors, and a particular reason of your interest.
What to include
Your submission of the report should include:
Names of students in your project group (up to two students)
EUIDs of group members
Title of the project
Medical condition
Data source
Number of data records
What I asked you in the “How to Write”
The order of contributions (i.e., work distribution) like authorship (e.g., first author, second author, third author, and so on.)
If you believe that some of you or all of you contributed equally, then you need to state it
Here, contributions include all phases you worked to submit this project report
Needs help with similar assignment?
We are available 24x7 to deliver the best services and assignment ready within 6-12hours? Order a custom-written, plagiarism-free paper
Get Answer Over WhatsApp Order Paper Now