UMUC Data 620 Assignment 12.1 Given Unstructured text data, Each Student Processes the Text Data and Creates Valuable Business Analytics

 

Are you in stress with your UMUC Data 620 Assignment 12.1 Homework? Our Experts are here for your assistance with your Python Assignment Writing Services at your budget price from Assignmenthelpaus.com. We offer the Coursework help with zero plagiarism work. Book your order now!

 

Text Analysis with Python

 

Given unstructured text data, each student processes the text data and creates valuable business analytics, similar to those in the Elder Research and Jiffy Lube case studies. Each student creates a managerial report identifying the patterns and recommendations uncovered in the data.

 

Complete instructions may be found below.

 

Turn in separate files with the following items.  Please label each with the TURN IN # statement so we can easily follow it.

  1. Read the Elder Research and Unilever case studies and note the types of information they were able to extract from the text data. (We don’t quite have that amount of information available, or quite that level of software, so your conclusions won’t be quite as detailed.)
  2. Elder Research Inc. (2013). Improving customer retention and profitability for a regional provider of wireless services. Retrieved from https://cdn2.hubspot.net/hubfs/2176909/Resources/Elder-Research-Case-Study-Customer-Retention-nTelos.pdf .
  3. Jiffy Lube Uses Odin Text Software to Increase Revenue.https://greenbookblog.org/2018/01/12/shell-oil-identifying-key-revenue-drivers-in-customer-comment-data/There’s also an interesting interview video here: https://www.youtube.com/embed/2Zxmjir8Zwo?autoplay=1
  4. These case studies are for inspiration only; there is nothing to turn in from them.
  5. Choose a readily available text item which recurred over at least three time periods, spaced some distance apart. You want to make sure you have at least several thousand words of text (more than 10 pages) for each time period, and you want something which will change noticeably over that time period.  Some options could include:
    • The CEO’s letter to shareholders (I do Amazon.com below as an example – you may choose any other company) in three different years; you may want to supplement with additional information from the company if this isn’t long enough.
    • The State of the Union Address (or other political speeches) from three different Presidents, such as those from Woodrow Wilson, Lyndon B. Johnson, and Barack Obama
    • The State of the State Addresses from any one of our 50 states
    • A write up of something technical (like descriptions of Motor Trend’s Car of the Year and Finalists) from 1950, 1980, and 2010.
    • Some industry write-up (such as PC Magazine’s best new computers for 1979, 1989, and 1999).
    • Some policy over time (such as Google’s Privacy Policies)
    • Recommendations about what to eat (such as the government’s Nutrition Guidelines. It could be interesting to cross-index the low-fat recommendations from 1980 with today’s ketogenic or Paleo diets.)
  6. Match your timeline to the subject. For political speeches, you will get the best results if they are at least 50 years apart.  For faster-changing items (such as the cellular phone user’s manual), you can probably get away with things 10 years apart.  You will have a much easier time making good graphs if you give yourself good raw data to work with.
  7. Select your desired number of time periods. You must have at least three, and can use as many as you like.  (i.e. 2000, 2005, and 2010 would be three time periods.)
  8. Take your text items for each year and convert them into a text input file.
  9. Run a Python program to determine the top X word count for each year. You will need to determine how many words you are going to use in your analysis; you should probably have somewhere between 5 and 30.
    • You will need to make decisions about stop words.
    • Make sure the bulk of your text processing is done in Python, not using the “search/replace” functions in Excel or Word. Part of this class’ skillset is exposure to Python, and this is how you should do it here.
  10. Merge your Python word count data into an input file for Tableau. You may find it helpful to use Excel or some other tool for this.
  11. Analyze your data. Emphasis here will be placed on visual analysis and text analysis.
  12. As part of your analysis, take the top 3 interesting relevant words from your latest time period. (You can use some judgment here; for Amazon, “kindle” would be more interesting than “book” even if “book” had more occurrences.)  Trace the trajectory of each of these 3 words over time – for example, at Amazon, the word “kindle” gains tremendously in popularity over time.
  13. Create a managerial report in Word outlining your findings.

 

Use what you already know about visualization.  Additionally, some Tableau graphs you may find helpful for this sort of analysis include:

  • Bump charts – to show changes in rank of items over timehttp://www.tableau.com/learn/tutorials/on-demand/bump-charts
  • Packed Bubble Charts – http://onlinehelp.tableau.com/current/pro/desktop/en-us/help.htm#buildexamples_bubbles.html

 

Australian Expert Assignment Help

 

Reference ID: #getanswers2001168