🤖 AI Influencers will kill IT sector

🦾Plus: 🎬 YouTube brings Veo2 to Shorts

Hey folks! Let’s get into Big Data and AI craziness…

In today's edition: What's Shaping the Future of Data?

  • 🤖500 Use Cases for Generative AI

  • 🛠️BAML is like building blocks for AI engineers

  • đź“ŠTop Themes in Data in 2025

  • 🖥️Open sourcing kubenetmon

  • đź‘€ OpenAI lays out plans for GPT-5

  • 📽️ Adobe Firefly unveils first video generation model

  • 🧠 Anthropic prepares next major Claude model

  • đź’ˇ AI Tutorial:How to access Alibaba's new Qwen 2.5 AI model

  • 🤖 AI Tools and Data Tools to checkout

A data lake is a storage space where large amounts of data can be stored in their raw formats, the type of data to be stored may be structured, semi-structured, or unstructured data. Data lakes may be used for data exploration, data analytics, and machine learning.

Ready to uncover the secrets behind how industry giants like Netflix, Airbnb, and Doordash leverage machine learning for unparalleled success? Unlock a treasure trove of insights with 500 real-world case studies from 100+ companies, delving into practical ML use cases and lessons learned in designing ML systems.

There are two opposing forces in the world of data: an overall consolidation within the modern data stack & a massive expansion driven by AI capabilities. AI is rewriting every rule about what’s possible with data in 2025. Here are Theory’s Top Themes in Data in 2025 with the full presentation at the bottom

In this post, I’ll explain more about how BAML, a domain-specific language for helping LLMs generate better structured outputs, provides AI engineers the necessary building blocks to create more composable, testable and robust LLM and agentic workflows.

We faced this problem at ClickHouse Cloud too. We run on all 3 major cloud providers – AWS, GCP, and Azure. We operate in many regions for each provider, pushing the number of regions we have infrastructure in into dozens, and that’s excluding our staging and development environments. Each region is typically home to multiple Kubernetes clusters.

Tech-illiterate managers see AI-generated hype and think they need to disrupt everything: cut salaries, push impossible deadlines and replace skilled workers with AI that barely functions. Instead of making IT more efficient, they drive talent away, lower industry standards and create burnout cycles. The results? Worse products, more tech debt and a race to the bottom where nobody wins except investors cashing out before the crash.

👨‍💻 Data Tools, Libraries

ESM2 in Equinox (GitHub Repo)

ESM2 is a great protein folding model. This is a well-maintained version in a popular Jax package.

Data Formulator (GitHub Repo)

A great little tool from Microsoft that uses language models to create charts and answer questions about datasets. It unfortunately doesn't support local models, but works well with an API key.

Page Assist (GitHub Repo)

One of many new tools for web browsing with local language models.

AI News:

Elon Musk's offer to buy OpenAI for $97.4 billion has a clear deadline: May 10. There is a stipulation that the buyers are able to examine OpenAI's financial and business records and interview OpenAI staff before the all-cash transaction is finalized. The offer undermines Musk's legal claims that OpenAI's startup's assets can't be transferred away for private gain.

Adobe's Firefly Video model is now in public beta testing offering AI video tools trained using licensed content.The system creates 1080P video clips from text inputs or image prompts and it works with Premiere Pro. Tiered pricing plans provide different access levels starting at just $9.99 each month.

Anthropic is set to release a new AI model in the coming weeks, according to a new report from The Information — which will combine traditional language capabilities with advanced reasoning features, excel at coding, and offer developers more control over balancing speed and compute.

YouTube announced that it is rolling out Veo 2, Google DeepMind's latest video generation model, into its Shorts platform — allowing creators to generate custom video clips and backgrounds directly from text descriptions.

OpenAI's CEO reveals future plans for GPT-4.5 and the awaited GPT-5.OpenAI plans to make its product lineup simpler and it aims for a unified intelligence experience.GPT-4.5, previously Orion, will be the last model of its kind before a big shift. GPT-5 is planned to integrate various technologies offering different intelligence levels for users that are free Plus and Pro.

AI Tutorial

How to access Alibaba's new Qwen 2.5 AI model

Qwen 2.5 is Alibaba’s latest AI model, which the company claims outperforms GPT-4o, DeepSeek-V3, and Llama-3.1-405B. Here’s how you can try it out:

Access Qwen 2.5 on Qwen Chat

  1. Go to chat.qwenlm.ai.

  2. Click Sign in and log in with your email or Google account.

  3. Once logged in, find the drop-down menu at the top to select different Qwen 2.5 versions.

  4. Choose the version you want to use (Qwen 2.5 Plus, Max, or other variants).

  5. Enter your prompt in the chatbox, and Qwen 2.5 will generate a response.

🔥Top AI tools to increase productivity: 

  1. Fakeface is an AI-powered online tool that empowers you to create your personalized face swap videos

  2. BlogPro is a free tool to publish, grow, and monetize your Notion content all in one place.

  3. Effie is a light, clean, yet powerful AI writing software that works across platforms, flexible for writing

  4. Deblank - intelligent design companion, featuring AI-enhanced color and font recommendations to expedite the initiation of your design projects.

  5. ProJourney allows you to use Midjourney without having to go through Discord.

  6. Moemate is an AI Studio which lets anyone create and chat with AI characters

View our database of all the best AI tools for your needs: aitoolsup.com

Have cool resources to share? Submit AI tool

A.I. Generated Image of the Day

đź‘€ Happy Valentine's Day!

AI Tools Up NewsletterReceive a weekly email with updates on new AI tools, helpful prompts, and the latest AI developments. Join over 10000 + professionals from Google, OpenAI, Notion, Apple, and more.

SPONSOR US

Get your product in front of Big Data & AI enthusiasts

Our newsletter is read by thousands of tech professionals, investors, engineers, managers, and business owners around the world.

Interested in Sponsoring the Big Data News Weekly Newsletter?Get in touch today

What did you think of today's email?

Your feedback helps me create better emails for you!

Login or Subscribe to participate in polls.