BDNW Issue - 83

Free Datasets for Data Science, Machine learning, AI, NLP 👨‍💻

#83 - Dec 6, 2023  News, articles on Big Data, AI, Data Science, ML, Cloud, IoT.

  Together with 

In today's edition:

  • ⚡️Dragonfly-a Redis replacement for heavy data workloads

  • 💻ML system design: 300 case studies to learn 

  • 📊Simplify Salesforce Report Snapshots

  • 🚀Snowflake for Beginners

  • 🧠Alibaba’s new AI tool turns images into realistic videos

  • 🛒 Mastercard Introduces AI Shopping Assistant

  • 👓 Seeing AI by Microsoft: Now on Android!

  • 🧠 DeepMind creates AI that can learn from humans

  • 🤯 A.I. Tools and News 

  • 🖼️ A.I. Generated Image of the Day

Introduction to Snowflake for Beginners

Snowflake is evolving to provide transactional and analytical capabilities in a single platform. Learn more about its capabilities.

An In-Depth Exploration of REST, gRPC, and GraphQL in Web Projects

Choosing an API plays a pivotal role in determining the success and efficiency of a web dev project. Explore 3 prominent contenders: REST, gRPC, and GraphQL.

Simplify Salesforce Report Snapshots: The Easiest Way To Track Salesforce History

Salesforce, the world’s most popular CRM, offers several built-in tools to help organizations track and analyze their data. One such tool is Salesforce Report Snapshots, which allows users to capture tabular data at specific points in time.

Introducing a Redis replacement for heavy data workloads



Need speed and flexibility in scaling dev teams? Revelo is the largest platform to hire world-class remote developers from LatAm. Get matched with vetted candidates in 3 days and receive a $2,500 credit on first hire. Start with a risk-free trial!

 Try Now!

Process Hundreds of GB of Data in the Cloud with Polars

Local machines can struggle to process large datasets due to memory and network limitations. Coiled Functions provide a cloud-based solution that allows for efficient and cost-effective handling of such extensive datasets, overcoming the constraints of local hardware for complex data processing tasks.

List of Web Components for Building an Analytics Dashboard

Dashboard is an effective way to visualize data due to their clarity, conciseness, and user-friendly interface. They are widely used to track key performance indicators (KPIs), analyze trends, and help to gain compelling insights and make informed decisions.
 

Presented below are datasets spanning a wide spectrum, catering to domains such as Data Science, Machine Learning, AI, NLP, Data Analysis, Analytics, Education, Computer Vision, Pricing Optimization, Classification, and Pre-Trained Models.

Critique: Google’s Code Review Tool



Google's internal code review tool Critique is highly rated among software engineers. This article looks at what makes Critique so good and explains how it pairs with Google's process of code review. It covers Google's guidelines for efficient code review, internal statistics on Google code reviews, and more. While Critique will never be open-sourced, Google maintains a similar open-source code review tool called Gerrit.

ML system design: 300 case studies to learn from



This site contains a database of 300 case studies from over 80 companies that share practical machine learning use cases and learnings from designing ML systems. The database can be filtered by industry or ML use case. Tags based on recurring themes are available.


🤖 AI News:

🧠DeepMind creates AI that can learn from humans

Researchers from Google DeepMind just developed a new way for AI agents to acquire knowledge from human demonstrations in real-time — allowing for "cultural transmission" without needing large datasets.

Alibaba’s new AI tool turns images into realistic videos



After Pika announced its lifelike text-to-video generator last week, Chinese tech giant Alibaba delivered another surprise by announcing Animate Anyone, a new AI tool that turns still images into videos.

Meta's chief scientist on AGI

Yann LeCun, Meta's chief scientist and a prominent figure in deep learning, believes that current AI systems, including those capable of processing large amounts of text, are decades away from achieving true sentience or human-like common sense.

Mastercard launches Shopping Muse, an AI-powered shopping assistant

Mastercard has unveiled "Shopping Muse," a novel generative AI shopping tool developed in collaboration with Dynamic Yield, a company it acquired in April 2022. The tool aims to revolutionize the digital shopping experience by providing personalized product recommendations based on users' colloquial language and understanding of modern trends.

Elon Musk’s AI startup — X.AI — files to raise $1 billion in fresh capital

👓Seeing AI by Microsoft: Now on Android!


🤖 AI Ethics: 

European SMEs Raise Issue With Potential AI Act Changes

European SMEs are voicing concerns over potential AI Act amendments that could potentially skew the competitive landscape. The crux of the issue: a proposal by France, Germany, and Italy suggesting Big Tech should self-regulate their foundational models. SMEs fear this could offload regulatory burdens onto them, heightening barriers to entry and stifling innovation.

 AI tools supercharge your productivity: 

  1. 🧑‍💻 Amazon CodeWhisperer is an innovative code generator powered by machine learning, designed to assist developers by offering real-time code recommendations.

  2. Vellum is the development platform for building LLM apps with tools for prompt engineering, semantic search, version control, testing, and monitoring.

  3. Lynn is the AI support partner that understands the complexities of modern customer support.*

  4. 🎙️ HearTheWeb - Convert Newsletters into Podcasts.

  5. MyMap generates mindmaps with ease

  6. MagicDash is your quick insight AI assistant

  7. Digger: Open source infrastructure as code management tool

  8. Marketing GPTs: Save time on daily marketing tasks

  9. Morph 1.0: AI-powered BI dashboard across your SaaS data
    Slack.

 View our database of all the best AI tools for your needs:

Have cool resources to share? Submit a tool or reach us by replying to this email. 

 

 Data Tools, Libraries

LLM Visualization 
This site looks at how the nano-gpt model, which has 85,000 parameters, sorts a sequence of six letters into alphabetical order.

Loco (GitHub Repo)
Loco is a Rust API and web framework for full stack product builders that is strongly inspired by Rails.

MLX (GitHub Repo)
MLX is an array framework for machine learning on Apple silicon with familiar APIs, composable function transformations, lazy computation, dynamic graph construction, and more.
 


Recommended Reading

The Average Joe
The IKEA instructions for investing to help you to become a better investor. Market trends & insights that are simple, concise, and impactful.

Peak Performance
For business professionals and entrepreneurs who are interested in learning about starting, growing, and scaling a business. Get free & practical solutions to unlock your full potential in work and life.

Design Hacks
Join 50,000+ people learning UX/UI design in short, practical lessons. Original illustrated tuts to help you design better websites and apps.

A.I. Generated Image of the Day

 

Superhero Fantasy Cover Art featuring(source)

Want to reach our audience / fellow readers? Consider Sponsoring - grab a spot now.

Big Data | Hadoop News | AI | ML | NoSQL | Education | IoT | Cloud
 

Tips? Suggestions? Feedback? email BDAN

Curated by @BDAnalyticsnews