MyGit

v0.9.4

microsoft/SynapseML

版本发布时间: 2021-11-16 13:19:37

microsoft/SynapseML最新发布版本:v1.0.4-spark3.5(2024-04-11 03:16:17)

SynapseML
Building production ready distributed machine learning pipelines can be a challenge for even the most seasoned researcher or engineer. We are excited to announce the release of SynapseML (Previously MMLSpark), an open-source library that aims to simplify the creation of massively scalable machine learning pipelines. SynapseML unifies several existing ML Frameworks and new MSFT algorithms in a single, scalable API that’s usable across Python, R, Scala, and Java.

Highlights

General Availability on Synapse ONNX on Spark Responsible AI Form Recognition and Translation Reinforcement Learning
We are ready to help you productionalize on Azure Synapse Analytics Distributed and hardware accelerated model inference on Spark Understand opaque-box models, measure dataset biases, Explainable Boosting Machines Parse PDFs and translate dataframes between over 100 languages Contextual Bandit Reinforcement Learning with Vowpal Wabbit

New Features

General ✨

ONNX on Spark 🕸

Cognitive Services for Big Data🧠

Responsible AI at Scale 😇

Vowpal Wabbit 🐇

LightGBM 🌳

Build and Infrastructure Improvements 🏭

Additional Updates

Bug Fixes 🐞

Documentation 📘

New Contributor Spotlight

We are excited to welcome several new developers to the SynapseML project.

Serena Ruan Jason Wang Wenqing Xu
Serena is an Engineer on the Azure Synapse team in Beijing. Within her first months working on SynapseML, Serena contributed Forms and Translator cognitive services, a unified logging and telemetry system, notebooks and documentation for every transformer and estimator, and a new docusaurus-based website. Jason is a Principal Engineer on Microsoft's DSP team and is focused on large-scale responsible AI. Jason started his contribution streak with a new API for model explainability that unifies both SHAP and LIME. Jason has also contributed ONNX on Spark which dramatically broadens the scope of models that can be used in SynapseML. Wenqing is a software engineer on the Azure Synapse team in Beijing. Wenqing has been instrumental in preparing SynapseML for General Availability. In particular, Wenqing added support for linked service authentication of cognitive services, extended E2E testing to Synapse Analytics, and added the PII identification service.
Kashyap Patel Rohit Agrawal Jack Gerrits
Kashyap is an Engineer on Microsoft's DSP team working on improving the fairness of machine learning models. Kashyap contributed tools for assessing dataset bias without requiring a labelled dataset or model. Rohit is a Senior Engineer on Microsoft's Cognitive Service team working on large-scale orchestration of intelligent services. Rohit modernized our Text Analytics Stack by updating to v3.0 and laid the groundwork for E2E testing on Synapse Analytics. Jack is a Senior Engineer on the decision service and reinforcement learning team at Microsoft Research NYC. Jack contributed support for contextual bandit reinforcement learning with Vowpal Wabbit.

Acknowledgements

We would like to acknowledge the developers and contributors, both internal and external who helped create this version of SynapseML

Jason Wang, Serena Ruan, Ilya Matiach, Jack Gerrits, Kashyap Patel, Wenqing Xu, Markus Weimer, Jeff Zheng, Nellie Gustafsson, Ruixin Xu, Martha Laguna, Markus Cozowicz, Rohit Agrawal, Daniel Ciborowski, Jako Tinkus, Tom Finley, Tomas Talius, Mitrabhanu Mohanty, Roy Levin, Anand Raman, William T. Freeman, Ryan Hurey, Sharath Chandra, Beverly Kodhek, Assaf Israel, Nisheet Jain, Ryan Hurey, Miguel Fierro, Dotan Patrich, Akshaya Annavajhala (AK), Euan Garden, Lev Novik, Guolin Ke, Tara Grumm, Keunhyun Oh, Vanunts Arsenii, Alexandr Severinov, David Lacalle Castillo, Ryosuke Horiuchi, Ashish Solanki, Matthieu Maitre, ONNX Team, Azure Global, Vowpal Wabbit Team, Light GBM Team, MSFT Garage Team, MSR Outreach Team, Speech SDK Team

Learn More

Visit our new website for the latest docs, demos, and examples Read more about SynapseML in the Microsoft Research Blog Get started with SynapseML on Azure Synapse Analytics
Read the Synapse Analytics Ignite Announcements Read our Paper from IEEE Big Data '21 Watch our ODSC Webinar on working with AI services at scale

相关地址:原始地址 下载(tar) 下载(zip)

查看:2021-11-16发行的版本