Skip to content

Releases: microsoft/SynapseML

SynapseML v1.1.0

20 Nov 03:51

Choose a tag to compare


SynapseML: Simple and distributed machine learning

We are excited to announce the release of SynapseML v1.1 marking a host of powerful new features introduced since the initial v1.0 release. SynapseML is an open-source library that aims to streamline the development of massively scalable machine learning pipelines. It unifies several existing ML Frameworks and new Microsoft algorithms in a single, scalable API that is usable across Python, R, Scala, and Java. SynapseML is usable from any Apache Spark platform with first class enterprise support on Microsoft Fabric.

Highlights

Microsoft Fabric AI Functions OneLake
Build and operationalize distributed ML with SynapseML in Fabric Apply Pandas and Spark LLM transformations with one line of code Automatically derive AI insights for unstructured data in OneLake
Build Your First Model Explore AI Functions Learn More
Hugging Face Azure AI Foundry
Use open source models hosted on Hugging Face Run Azure AI Foundry models in your notebook
Try an Example View Notebook

More Hightlights

Spark 3.5 Support – In this version we transitioned to Spark 3.5 as our main Spark platform.

OpenAI Ecosystem – Comprehensive improvements including global parameter defaults, GPT-4 enablement, custom endpoints/headers, GPU-accelerated embeddings with KNN, and fine-grained control over model parameters (top_p, seed, responseFormat, temperature).

ML Innovation – HuggingFaceCausalLM transformer for distributed language model evaluation, custom embedder support, and synthetic difference-in-differences causal inference module.

Platform features – Spark Native OneLake support; MSI for Azure Storage; OpenAITranslate transformer.

AI Functions in Data Wrangler on Fabric – AI Functions built into Data Wrangler in Fabric allow you to apply LLM-powered operations to your dataframe without writing a single line of code.

New Features

Documentation πŸ“š

AI Functions ⚑

Azure OpenAI 🌸

  • Enhanced Model Parameters – Added top_p, seed, responseFormat, temperature, and subscription key support (#2410, #2329, #2324)
  • GPT-4 Enablement – Full GPT-4 support in OpenAIPrompt (#2248)
  • Custom Endpoints & Headers – Support for custom URL endpoints and HTTP headers (#2232)
  • GPU-Accelerated Embeddings – OpenAI embeddings with GPU-based KNN pipeline (#2157)
  • Embedding Dimensions Control – Configurable dimensions parameter for OpenAIEmbedding (#2215)
  • Global Parameter Defaults – Centralized OpenAI parameter management with Python wrapper support (#2318, #2327)
  • Updated OpenAI API version to 2024 (#2190)
  • Updated OpenAIDefaults implementation (#2415)
  • OpenAIPrompt bug fixes and improvements (#2334)
  • Added responseFormat parameter to Chat Completion (#2329)
  • Optimized getOptionalParams in HasOpenAITextParams (#2315)

OneLake 🌊

  • Add Spark Native OneLake support (#1190687)

Machine Learning πŸ•ΈοΈ

  • HuggingFaceCausalLM – Transformer for evaluating language models on Spark clusters (#2301)
  • Custom Embedder – Extensible custom embedding transformer support (#2236)
  • Synthetic DiD – Synthetic difference-in-differences module for causal inference (#2095)

Azure AI Foundry πŸ”¨

  • AIFoundryChatCompletion – New transformer for Azure AI Foundry chat models (#2398)
  • AI Foundry + OpenAI Prompt – Unified interface for OpenAI and Foundry deployments (#2404)

General ✨

  • Add Spark 3.5 Support – Added full Spark 3.5 compatibility with new build variants (#2052)
  • Python 3.11 Baseline – Upgraded to Python 3.11 as minimum version (#2193)
  • Fabric Billing Integration – Enhanced Fabric Cognitive Service token for billing support (#2291)
  • Fabric WSPL FQDN Selection – Configurable Fabric workspace FQDN endpoints (#2376)
  • Added Bool input support for ONNX models ([#2130...
Read more

SynapseML v1.0.14

16 Sep 06:45
123ead2

Choose a tag to compare

Changes:

See More

This list of changes was auto generated.

SynapseML v1.0.14 Spark 3.5

16 Sep 06:47

Choose a tag to compare

Pre-release
v1.0.14-spark3.5

Enabling Synapse 3.5 tests

SynapseML v1.0.13

04 Aug 19:05
a394061

Choose a tag to compare

Changes:

This list of changes was auto generated.

SynapseML v1.0.13-spark3.5

04 Aug 19:08

Choose a tag to compare

Pre-release
Enabling Synapse 3.5 tests

v1.0.12-spark3.5

07 Jul 16:07

Choose a tag to compare

v1.0.12-spark3.5 Pre-release
Pre-release
Enabling Synapse 3.5 tests

SynapseML v1.0.12

03 Jul 20:33
9c91148

Choose a tag to compare

Changes:

See More
  • 9af855e fix: support scoring profiles in Azure Search index parsing (#2383)
  • 326988d chore: update sbt version to allow for amd local builds (#2384)
  • 3865e71 fix: fix model checking logic (#2379)
  • 1eec70d fix: fix bug where token cannot be acquired on system context (#2378)
  • 141039b fix: add hf causal LM python tests, fix build (#2374)
  • 6c95bf0 fix: add case for Python only envs (#2368)
  • c1cef65 chore: limit adb concurrency (#2370)

This list of changes was auto generated.

SynapseML v1.0.11

15 May 16:56
58b945f

Choose a tag to compare

Changes:

See More

This list of changes was auto generated.

SynapseML v1.0.11-spark3.5

17 Apr 03:06

Choose a tag to compare

Pre-release
chore: Adding Spark35 support

v1.0.10-spark3.5

24 Feb 04:08

Choose a tag to compare

v1.0.10-spark3.5 Pre-release
Pre-release
chore: Adding Spark35 support