ISSUE 437 · May 23, 2023TrendsHachette v. Internet Archive and the Future of Data AccessA recent ruling determined that the Internet Archive's lending program violates copyright laws. This post breaks down the issues and explores how the ruling sets a new precedent in the ongoing debate about copyright law, fair use, and preservation of knowledge in the digital age. Sponsored LinkLearning data science through social mediaWhen Parmida Beigi isn’t busy pursuing speech recognition and NLP initiatives as a senior research scientist at Amazon, she can be found on social media using her skills and lifetime worth of experience to help others grow into machine learning career paths. Tutorials & OpinionsTackling Climate Change with Machine LearningThis site from the recent ICLR 2023 Workshop includes links to papers, tutorials and proposals that explore ways that machine learning can be used to tackle climate change. There's a lot here and if you're interested in this space, it's worth getting on their mailing list. Understanding database Indexes in PostgreSQLThis post will help you understand how database indexes work and how to use them effectively. It's written with PostgreSQL in mind but it starts with the basics and can be helpful for understanding other databases too. A Comprehensive Guide to Vector DatabasesVector databases are a new wave of data management designed for time-series applications, generative AI, and IoT. Here’s why they matter, what makes them different, how they work, the new use cases they’re designed for, and how to get started. Sharpen your math, CS and data skills in 15 minutes a dayFor professionals and lifelong learners alike, Brilliant is one of the best ways to learn. The deets: Bite-sized interactive lessons make it easy to level up in everything from math and data science to AI and beyond. Join 10+ million people building skills every day. Start your 30-day free trial today! Tools & CodeDeepchecksDeepchecks is a Python package for comprehensively validating your machine learning models and data with minimal effort. This includes checks related to various types of issues, such as model performance, data integrity, distribution mismatches, and more. ResourcesMLOps guideAwesome collection of MLOps resources, from introductory to advanced. Includes posts, tutorials, lectures, book excerpts, case studies, key repos and more. Data VisualizationIntroduction to Data Visualization for the WebThis selection of posts, books, and videos is a great introduction to doing data visualization for the web. The selections here are based on a course at the University of Washington and is essentially, a collection of must-reads for anyone interested in this space. ggblend: Blending & compositing algebra for ggplot2ggblend is an algebra of operations for blending, copying, adjusting, and compositing layers in ggplot2. It allows you to easily copy and adjust the aesthetics or parameters of an existing layer, to partition a layer into multiple pieces for re-composition, and to combine layers using blend modes, like "multiply", "overlay", etc. OutlierWord SaladA chatbot pretending to be knowledgeable is today's version of Adriano Celentano's hit song, "Prisencolinensinainciusol". That chart-busting song from 1972 used made-up words to sound like English but was completely nonsensical. Likewise, chatbots have learned to sound good. But you need to pay attention to be sure they actually make sense. Was this email forwarded to you? Sign up here >> |
Поиск по этому блогу
Search1
123
вторник, 23 мая 2023 г.
Data Elixir - Issue 437
Подписаться на:
Комментарии к сообщению (Atom)
Комментариев нет:
Отправить комментарий
Примечание. Отправлять комментарии могут только участники этого блога.