OSS Business Incubator

Supported Projects (Past & Present)
Numpy

NumPy

NumPy is an open source project aiming to enable numerical computing with Python. It was created in 2005, building on the early work of the Numeric and Numarray libraries. NumPy will always be 100% open source software, free for all to use and released under the liberal terms of the modified BSD license. NumPy is developed in the open on GitHub, through the consensus of the NumPy and wider scientific Python community. For more information on our governance approach, please see our Governance Document.

xnd

XND

We are building XND to recreate the foundations of NumPy as a number of smaller libraries, combining the lessons learned in the past twenty years of array computing in Python with the needs of newer applications. This is not a replacement of NumPy. Eventually, NumPy could use XND as could Pandas, Dask, and other libraries. In fact, we are actively working on using XND in Numba and are also very interested in integrating it with other libraries.

dask

Dask

Dask provides advanced parallelism for analytics, enabling performance at scale. It’s a fiscally sponsored project of NumFOCUS, a nonprofit dedicated to supporting the open source scientific computing community. It’s open source. It’s developed projects like NumPy, pandas, and scikit-learn.
holoviews logo

HoloViews

HoloViews is an open-source Python library designed to make data analysis and visualization seamless and simple. With HoloViews, you can usually express what you want to do in very few lines of code.

pandas logo

Pandas

Pandas is a fast, powerful, flexible and easy to use open source data analysis and manipulation tool.

castra logo

Castra

Web application RPC library for Clojure/Script and Ring. It has great features. 

PyData logo

PyData

PyData is an educational program of NumFOCUS, a 501(c)(3) nonprofit charity.

tensorflow logo

TensorFlow

TensorFlow is an end-to-end open source platform for machine learning. It has a comprehensive, flexible ecosystem of tools, libraries and community resources that lets researchers push the state-of-the-art in ML and developers easily build and deploy ML powered applications.

blosc logo

Blosc

Blosc is a rapid binary data compressor that optimizes fetching data to the processor cache, outperforming traditional memory access methods. It reduces large datasets on disk or in memory and accelerates memory-intensive tasks like vector operations. 

conda

CONDA

Package, dependency and environment management for any language—Python, R, Ruby, Lua, Scala, Java, JavaScript, C/ C++, FORTRAN, and more. Conda is an open source package management system and environment management system that runs on Windows, macOS and Linux. Conda quickly installs, runs and updates packages and their dependencies. Conda easily creates, saves, loads and switches between environments on your local computer. It was created for Python programs

DyND logo

DyND

DyND is a C++ library for dynamic, multidimensional arrays. It is inspired by NumPy, the Python array programming library at the core of the scientific Python stack, but tries to address a number of obstacles encountered by some of its users. Examples of this are support for variable-sized string, ragged array types, and convenient usage from C++. The library is in a preview development state, and can be thought of as a sandbox.

ibis logo

Ibis

Ibis is a toolbox to bridge the gap between local Python environments (like pandas and scikit-learn) and remote storage and execution systems like Hadoop components (like HDFS, Impala, Hive, Spark) and SQL databases (Postgres, etc.). Its goal is to simplify analytical workflows and make you more productive.

Scipy

SciPy

SciPy (pronounced “Sigh Pie”) is a Python-based ecosystem of open-source software for mathematics, science, and engineering.

QHub logo

QHub

QHub enables teams to build and maintain a cost-effective and scalable compute/data science platform in the cloud or on-premises. 

Zarr logo

Zarr

Zarr is a Python package providing an implementation of compressed, chunked, N-dimensional arrays, designed for use in parallel computing.

blaze logo

DataShape

DataShape is a language for describing data. It is an extension of the NumPy dtype with an emphasis on cross language support.

ipython logo

IPython

A powerful interactive shell. A kernel for Jupyter. Support for interactive data visualization and use of GUI toolkits. Flexible, embeddable interpreters to load into your own projects. Easy to use, high performance tools for parallel computing.

holoviz logo

HoloViz

High-level tools to simplify visualization in Python.

scikit

SCIKIT Learn

Simple and efficient tools for predictive data analysis · Accessible to everybody, and reusable in various contexts · Built on NumPy, SciPy, and matplotlib. This project was started in 2007 as a Google Summer of Code project by David Cournapeau. Later that year, Matthieu Brucher started work on this project as part of his thesis. In 2010 Fabian Pedregosa, Gael Varoquaux, Alexandre Gramfort and Vincent Michel of INRIA took leadership of the project and made the first public release, February the 1st 2010. 

altair logo

Altair

Altair is a declarative statistical visualization library for Python, based on Vega and Vega-Lite, and the source is available on GitHub. With Altair, you can spend more time understanding your data and its meaning. Altair’s API is simple, friendly and consistent and built on top of the powerful Vega-Lite visualization grammar. This elegant simplicity produces beautiful and effective visualizations with a minimal amount of code.

bokeh logo

Bokeh

Bokeh is a Python library for creating interactive visualizations for modern web browsers. It helps you build beautiful graphics, ranging from simple plots to complex dashboards with streaming datasets. With Bokeh, you can create JavaScript-powered visualizations without writing any JavaScript yourself.

pytorch

PyTorch

An open source machine learning framework that accelerates the path from research prototyping to production deployment. PyTorch, the PyTorch logo and any related marks are trademarks of Facebook, Inc.

conda_forge logo

CONDA-FORGE

A community-led collection of recipes, build infrastructure and distributions for the conda package manager.

 

jitsi logo

jitsi.org

A a free open-source video conferencing software for web & mobile

panel logo

Panel

A high-level app and dashboarding solution for Python.
 

pyviz logo

PyViz

An open platform for helping users decide on the best open-source (OSS) Python data visualization tools for their purposes, with links, overviews, comparisons, and examples.
datashader

Datashader

Datashader is a graphics pipeline system for creating meaningful representations of large datasets quickly and flexibly. Datashader breaks the creation of images into a series of explicit steps that allow computations to be done on intermediate representations. This approach allows accurate and effective visualizations to be produced automatically without trial-and-error parameter tuning, and also makes it simple for data scientists to focus on particular data and relationships of interest in a principled way.

spyder logo

SPYDER

Spyder is a free and open source scientific environment written in Python, for Python, and designed by and for scientists, engineers and data analysts. It features a unique combination of the advanced editing, analysis, debugging, and profiling functionality of a comprehensive development tool with the data exploration, interactive execution, deep inspection, and beautiful visualization capabilities of a scientific package.

 

numba logo

Numba

Numba translates Python functions to optimized machine code at runtime using the industry-standard LLVM compiler library. Numba-compiled numerical algorithms in Python can approach the speeds of C or FORTRAN. You don’t need to replace the Python interpreter or run a separate compilation step

Jupyter logo

Jupyter

Project Jupyter exists to develop open-source software, open-standards, and services for interactive computing across dozens of programming languages. Supporting Jupyter Lab and Jupyter Notebooks.

project_odo logo

Project_odo

Data migration between different storage systems.

python logo

Python

Python is a powerful, fast, scalable, firendly, easy to learn, and open source programming language.

sympy logo

SymPy

SymPy is a Python library for symbolic mathematics. It aims to become a full-featured computer algebra system (CAS) while keeping the code simple.

matplotlib logo

Matplotlib

Matplotlib is a comprehensive library for creating static, animated, and interactive visualizations in Python.