ISSUE 455 · October 3, 2023Posts & TutorialsData-driven news discourse analysis with PythonIn this two-part tutorial, Karlis Kanders shows how to track and analyze news and public discussions about specified topics. Part 1 covers data access and how to perform simple topic analysis. Part 2 explores ways to analyze the discourse in finer detail. This is easy to follow and the techniques could be applied to any text data and topic of your interest. 24 hours in an invisible epidemicPart story-telling and part quantified-self, this post is an awesome look at the data gathered by the American Time Use Survey. The survey asks tens of thousands of Americans to track one day of their lives and it provides a lot of detail about what those days look like. Also, see the followup, "Behind the scenes with 24 hours in an invisible epidemic." The Most Unexpectedly Good and Bad TV EpisodesNice tutorial showing how to access and analyze IMDB data using R. There isn't anything fancy here but it's very well done and could be easily extended to explore other aspects of the IMDB data. Includes lots of code and screenshots along the way. Sponsored LinkWriting 100s of lines of code to gather external API data?Stop building and managing API integrations and data pipelines and return to shipping features. Access any data source, store, or ML model from a single endpoint — SQL, GraphQL, or REST. How often does Roy Kent say "F*CK"?Great post (and screencast!), showing how to use Poisson regression and bootstrap resampling to find confidence intervals for when Roy Kent (from Ted Lasso) drops F-Bombs. The screencast is a step by step code-through while Julia talks about her decisions along the way. This is a fun post and the ideas here are particularly useful with small datasets. SQLite insert speedups, from the useful to the absurdHow fast can you insert data into SQLite? Faster than you probably think. This post explores ways to get the best insert performance out of SQLite generally; what's in your control, and what isn't. There's a linked index at the top or skip to the end for the most important tips. Fine, I'll run a regression analysisOf course, data should win an argument, right? Well, not on the Internet. In this post, Nate Silver describes his approach for making arguments with data and how sometimes he ends up more in the weeds than he'd like. Ultimately, he shows here that COVID vaccination rates are strongly predictive of COVID death rates. Build an API in the cloud in less than 200 lines of codeCloud tools and Python packages have become so powerful that you can build a scalable, cloud-based API in less than 200 lines of code. In this post, you’ll see how to use Google Cloud, Terraform, and FastAPI to deploy a queryable data API on the cloud. Webinar: Hot takes and heartbreaks from business and data leadersJoin Benn Stancil, Field Chief Technology Officer of ThoughtSpot and co-founder of Mode on October 10 at 9 a.m. PT for a live panel with leaders from Matillion and Jasper.ai. They will get candid about the common misconceptions and stereotypes data leaders face and how to collaborate more effectively with your business leaders. RSVP here. Tools & CodeVizroVizro is an open-source Python toolkit that's designed to be a simple, low-code way to create scalable visualization applications. It's low-code, but even so, it offers a lot of flexibility for advanced solutions. OutlierWiFi without internet on a Southwest flightGreat discussion about WiFi, hacking, and how to access data that you probably shouldn't. The post by James Vaughan that kicked off the discussion is also worthwile. Was this email forwarded to you? Sign up here >> |
Поиск по этому блогу
Search1
123
вторник, 3 октября 2023 г.
Data Elixir - Issue 455
Подписаться на:
Комментарии к сообщению (Atom)
Комментариев нет:
Отправить комментарий
Примечание. Отправлять комментарии могут только участники этого блога.