Close Menu
TheKhaleejPost
    What's Hot
    Business

    Faraday Future Launches FX Super One in the UAE

    Business

    QRDI council concludes 10th edition of SME Program, empowering Qatar’s innovators

    Culture

    Petros Nearchou Embarks on Mount Kilimanjaro Climb for the Nearchou Foundation’s First Charitable Mission

    Important Pages:
    • Home
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    TheKhaleejPost
    • Home
    • Technology

      Faraday Future Exhibits at WETEX 2025 and Announces FX SuperOne Final Launch on October 28 in Dubai

      FootAR Showcased Augmented Reality at the SBC Summit Lisbon 2025

      Snap launches App Power Pack to support advertisers in Saudi Arabia

      Future-focused: Swiss International Scientific School in Dubai expands IB Career-related Programme

      ArabyAds Talent introduces the next evolution of celebrity management and representation for MENA’s public figures

    • Business

      Dubai Shopping Festival Reimagines the City as a Living Canvas Through Art, Light and Design

      Faraday Future Announces the FX Super One Roadmap for Mass Production, Sales, Delivery, Service and Ramp-Up and Its Entry into Embodied AI Robotics

      RTA Completes 60% of Oud Maitha Road and Al Asayel Street Development Project

      Pure Gold Jewellers Celebrates the UAE’s Year of the Family with a Timeless Gift of Love, Peace and Harmony

      Emirates Post and Shabab Al Ahli commemorate Al Fursan’s historic quadruple with stamp collection

    • Submit A Press Release
    TheKhaleejPost
    Home » Snowflake Teams Up with Meta to Host and Optimize New Flagship Model Family in Snowflake Cortex AI
    Business

    Snowflake Teams Up with Meta to Host and Optimize New Flagship Model Family in Snowflake Cortex AI

    Facebook Twitter Pinterest WhatsApp
    Share
    Facebook Twitter Pinterest WhatsApp

    July, 2024 – Snowflake (NYSE: SNOW), the AI Data Cloud company, today announced that it will host the Llama 3.1 collection of multilingual open source large language models (LLMs) in Snowflake Cortex AI for enterprises to easily harness and build powerful AI applications at scale. This offering includes Meta’s largest and most powerful open source LLM, Llama 3.1 405B, with Snowflake developing and open sourcing the inference system stack to enable real-time, high-throughput inference and further democratize powerful natural language processing and generation applications. Snowflake’s industry-leading AI Research Team has optimized Llama 3.1 405B for both inference and fine-tuning, supporting a massive 128K context window from day one, while enabling real-time inference with up to 3x lower end-to-end latency and 1.4x higher throughput than existing open source solutions. Moreover, it allows for fine-tuning on the massive model using just a single GPU node — eliminating costs and complexity for developers and users — all within Cortex AI.

    By partnering with Meta, Snowflake is providing customers with easy, efficient, and trusted ways to seamlessly access, fine-tune, and deploy Meta’s newest models in the AI Data Cloud, with a comprehensive approach to trust and safety built-in at the foundational level.

    “Snowflake’s world-class AI Research Team is blazing a trail for how enterprises and the open source community can harness state-of-the-art open models like Llama 3.1 405B for inference and fine-tuning in a way that maximizes efficiency,” said Vivek Raghunathan, VP of AI Engineering, Snowflake. “We’re not just bringing Meta’s cutting-edge models directly to our customers through Snowflake Cortex AI. We’re arming enterprises and the AI community with new research and open source code that supports 128K context windows, multi-node inference, pipeline parallelism, 8-bit floating point quantization, and more to advance AI for the broader ecosystem.”

    Snowflake’s Industry-Leading AI Research Team Unlocks the Fastest, Most Memory Efficient Open Source Inference and Fine-Tuning
    Snowflake’s AI Research Team continues to push the boundaries of open source innovations through its regular contributions to the AI community and transparency around how it is building cutting-edge LLM technologies. In tandem with the launch of Llama 3.1 405B, Snowflake’s AI Research Team is now open sourcing its Massive LLM Inference and Fine-Tuning System Optimization Stack in collaboration with DeepSpeed, Hugging Face, vLLM, and the broader AI community. This breakthrough establishes a new state-of-the-art for open source inference and fine-tuning systems for multi-hundred billion parameter models.

    Massive model scale and memory requirements pose significant challenges for users aiming to achieve low-latency inference for real-time use cases, high throughput for cost effectiveness, and long context support for various enterprise-grade generative AI use cases. The memory requirements of storing model and activation states also make fine-tuning extremely challenging, with the large GPU clusters required to fit the model states for training often inaccessible to data scientists.

    Snowflake’s Massive LLM Inference and Fine-Tuning System Optimization Stack addresses these challenges. By using advanced parallelism techniques and memory optimizations, Snowflake enables fast and efficient AI processing, without needing complex and expensive infrastructure. For Llama 3.1 405B, Snowflake’s system stack delivers real-time, high-throughput performance on just a single GPU node and supports a massive 128k context windows across multi-node setups. This flexibility extends to both next-generation and legacy hardware, making it accessible to a broader range of businesses. Moreover, data scientists can fine-tune Llama 3.1 405B using mixed precision techniques on fewer GPUs, eliminating the need for large GPU clusters. As a result, organizations can adapt and deploy powerful enterprise-grade generative AI applications easily, efficiently, and safely.

    Snowflake’s AI Research Team has also developed optimized infrastructure for fine-tuning inclusive of model distillation, safety guardrails, retrieval augmented generation (RAG), and synthetic data generation so that enterprises can easily get started with these use cases within Cortex AI.

    Snowflake Cortex AI Furthers Commitment to Delivering Trustworthy, Responsible AI
    AI safety is of the utmost importance to Snowflake and its customers. As a result, Snowflake is making Snowflake Cortex Guard generally available to further safeguard against harmful content for any LLM application or asset built in Cortex AI — either using Meta’s latest models, or the LLMs available from other leading providers including AI21 Labs, Google, Mistral AI, Reka, and Snowflake itself. Cortex Guard leverages Meta’s Llama Guard 2, further unlocking trusted AI for enterprises so they can ensure that the models they’re using are safe.

    Comments on the News from Snowflake Customers and Partners
    “As a leader in the hospitality industry, we rely on generative AI to deeply understand and quantify key topics within our Voice of the Customer platform. Gaining access to Meta’s industry-leading Llama models within Snowflake Cortex AI empowers us to further talk to our data, and glean the necessary insights we need to move the needle for our business,” said Dave Lindley, Sr. Director of Data Products, E15 Group. “We’re looking forward to fine-tuning and testing Llama to drive real-time action in our operations based on live guest feedback.”

    “Safety and trust are a business imperative when it comes to harnessing generative AI, and Snowflake provides us with the assurances we need to innovate and leverage industry-leading large language models at scale,” said Ryan Klapper, an AI leader at Hakkoda. “The powerful combination of Meta’s Llama models within Snowflake Cortex AI unlocks even more opportunities for us to service internal RAG-based applications. These applications empower our stakeholders to interact seamlessly with comprehensive internal knowledge bases, ensuring they have access to accurate and relevant information whenever needed.”

    “By harnessing Meta’s Llama models within Snowflake Cortex AI, we’re giving our customers access to the latest open source LLMs,” said Matthew Scullion, Matillion CEO and co-founder. “The upcoming addition of Llama 3.1 gives our team and users even more choice and flexibility to access the large language models that suit use cases best, and stay on the cutting-edge of AI innovation. Llama 3.1 within Snowflake Cortex AI will be immediately available with Matillion on Snowflake’s launch day.”

    “As a leader in the customer engagement and customer data platform space, Twilio’s customers need access to the right data to create the right message for the right audience at the right time,” said Kevin Niparko VP, Product and Technology Strategy, Twilio Segment. “The ability to choose the right model for their use case within Snowflake Cortex AI empowers our joint customers to generate AI-driven, intelligent insights and easily activate them in downstream tools. In an era of rapid evolution, businesses need to iterate quickly on unified data sets to drive the best outcomes.”

    Share. Facebook Twitter Pinterest WhatsApp
    Previous ArticleCinnamon Hotels and Resorts unveils the unforgettable Maldivian experience through the chain’s first-ever best rate guaranteed promise
    Next Article Over 34,000 new companies become members of Dubai Chamber of Commerce during H1 2024,representing YoY growth of 5%

    Related Posts

    Business

    Dubai Shopping Festival Reimagines the City as a Living Canvas Through Art, Light and Design

    Business

    Faraday Future Announces the FX Super One Roadmap for Mass Production, Sales, Delivery, Service and Ramp-Up and Its Entry into Embodied AI Robotics

    Business

    RTA Completes 60% of Oud Maitha Road and Al Asayel Street Development Project

    Business

    Pure Gold Jewellers Celebrates the UAE’s Year of the Family with a Timeless Gift of Love, Peace and Harmony

    Business

    Emirates Post and Shabab Al Ahli commemorate Al Fursan’s historic quadruple with stamp collection

    Business

    Emirates flags busy week of travel from Jan 2 – Jan 5

    Business

    Max Fashion Brings Seasonal Style to the Forefront With its Winter Sale

    Business

    Over 2.8 Million Passengers Used Public Transport on New Year’s Eve 2026

    Stay In Touch
    • Facebook
    • Twitter
    Top Picks
    Business

    Emirates Post and Shabab Al Ahli commemorate Al Fursan’s historic quadruple with stamp collection

    Dubai, UAE, December 31, 2025: Emirates Post, in collaboration with Shabab Al Ahli Club, released a special commemorative stamp collection titled ‘Shabab Al Ahli Club – Quadruple Champions 2024/2025,’ celebrating the club’s historic achievement of winning four major titles in a single, remarkable season. This landmark success has established the ‘Fursan Al Humur’ (Red Knights) as one of the most successful teams in the UAE’s football history. The stamp launch was celebrated at Rashid Stadium, the Red Knights’ home ground, in the presence of Khalifa Saeed Sulaiman, Vice Chairman of Shabab Al Ahli Club; H.E. Major General Talal Al Shanqit

    Ministry of Culture Announces recipients of the Fourth National Grant Program for Culture and Creativity

    NEOM remains top destination for Saudi and GCC HNWI homebuyers, says Knight Frank

    FiLLi Cafe taps COFE Cloud to power global expansion through specialized digital solutions

    Hospinorm Brings Etiometry’s FDA-Cleared AITechnology to Transform Critical Care in the UAE

    Stay ahead with TheKhaleejPost, your premier destination for breaking news, analysis, and in-depth coverage across the Gulf region and beyond. Discover diverse viewpoints, stay informed, and delve into the stories shaping our communities and world.

    Facebook X (Twitter)
    Categories
    • Business (589)
      • Career (1)
      • Management (5)
      • Startup (12)
    • Business & Tech (1)
    • Culture (77)
    • Entertainment (31)
      • Celebrity (3)
      • Movies (2)
      • TV Shows (1)
    • News (75)
    • Politics (12)
    • Technology (67)
      • Phones (2)
      • Science (2)
    • Videos (10)
    Our Picks
    Business

    Dubai Police Warn of Online Scams Involving Fake Tickets for Concerts, Events and Travel

    Business

    Presight to showcase ‘Applied Intelligence Today’ across GITEX GLOBAL and Expand North Star 2025

    Business

    KORA Properties redefines waterfront luxury with IL VENTO tower in Dubai Maritime City

    © 2026 TheKhaleejPost.
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions

    Type above and press Enter to search. Press Esc to cancel.