Showing 780 open source projects for "python data analysis"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • AI-powered SAST and AppSec platform that helps companies find and fix vulnerabilities. Icon
    AI-powered SAST and AppSec platform that helps companies find and fix vulnerabilities.

    Trusted by 750+ companies and performing 200k+ code scans monthly.

    ZeroPath (YC S24) is an AI-native application security platform that delivers comprehensive code protection beyond traditional SAST. Founded by security engineers from Tesla and Google, ZeroPath combines large language models with advanced program analysis to find and automatically fix vulnerabilities.
    Learn More
  • 1
    UnionML

    UnionML

    Build and deploy machine learning microservices

    Creating ML apps should be simple and frictionless. UnionML is an open-source Python framework built on top of Flyte™, unifying the complex ecosystem of ML tools into a single interface. Combine the tools that you love using a simple, standardized API so you can stop writing so much boilerplate and focus on what matters: the data and the models that learn from them. Fit the rich ecosystem of tools and frameworks into a common protocol for machine learning.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Neural Network Visualization

    Neural Network Visualization

    Project for processing neural networks and rendering to gain insights

    nn_vis is a minimalist visualization tool for neural networks written in Python using OpenGL and Pygame. It provides an interactive, graphical representation of how data flows through neural network layers, offering a unique educational experience for those new to deep learning or looking to explain it visually. By animating input, weights, activations, and outputs, the tool demystifies neural network operations and helps users intuitively grasp complex concepts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Twinify

    Twinify

    Privacy-preserving generation of a synthetic twin to a data set

    twinify is a software package for the privacy-preserving generation of a synthetic twin to a given sensitive tabular data set. On a high level, twinify follows the differentially private data-sharing process introduced by Jälkö et al.. Depending on the nature of your data, twinify implements either the NAPSU-MQ approach described by Räisä et al. or finds an approximate parameter posterior for any probabilistic model you formulated using differentially private variational inference (DPVI)....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Wooey

    Wooey

    A Django app that creates automatic web UIs for Python scripts

    ...Enable the easy wrapping of any program in simple python instead of having to use language specific to existing tools such as Galaxy. Enable fellow lab members with no command line experience to utilize python scripts. Autodocument workflows for data analysis (simple model saving).
    Downloads: 0 This Week
    Last Update:
    See Project
  • deskbird is the most intuitive desk booking app for your hybrid office. Icon
    deskbird is the most intuitive desk booking app for your hybrid office.

    With deskbird, creating an efficient workplace has never been easier.

    For companies in need of a people-centric workplace management solution so employees can see who is in the office, schedule their office and work-from-home days, and book resources for office days.
    Learn More
  • 5
    FairScale

    FairScale

    PyTorch extensions for high performance and large scale training

    FairScale is a collection of PyTorch performance and scaling primitives that pioneered many of the ideas now used for large-model training. It introduced Fully Sharded Data Parallel (FSDP) style techniques that shard model parameters, gradients, and optimizer states across ranks to fit bigger models into the same memory budget. The library also provides pipeline parallelism, activation checkpointing, mixed precision, optimizer state sharding (OSS), and auto-wrapping policies that reduce...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    Name-That-Hash

    Name-That-Hash

    Identify MD5, SHA256 and 300+ other hashes

    ...One of its core ideas is popularity-aware ranking: when you feed in a hash, it prioritizes likely real-world types such as NTLM over obscure ones like Skype hashes, instead of treating them equally. The tool provides concise “hash summaries” that explain where a given hash format is commonly used, helping users decide how to proceed with cracking or further analysis. Name-That-Hash is accessible via a Python CLI (nth) and also exposes an API and JSON output, making it easy to plug into other tools or workflows, and there is also a web app that requires no local installation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    agglo

    agglo

    Multi facets CASE/AGL for easy project developments

    Agglo is a CASE (Computer Aided Software Engineering) or AGL (Atelier Génie Logiciel), whose aim is to facilitate installation ans use of various tools (either existing opensource tools, or agglo tools), in several facets of project development: - requirements management - unit tests - automated integration tests - toolkit for various languages (cpp, c, python, shell, xsl) other facets are to come ultimately: - coverage tests - planification - integration with hudson - code quality analysis - indicator production
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    codecat

    codecat

    static code analysis, to find/track sinks and bugs

    CodeCat is a open source tool to help you in static code analysis, to find/track sinks and bugs, this points follow regex rules...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Jraph

    Jraph

    A Graph Neural Network Library in Jax

    Jraph (pronounced “giraffe”) is a lightweight JAX library developed by Google DeepMind for building and experimenting with graph neural networks (GNNs). It provides an efficient and flexible framework for representing, manipulating, and training models on graph-structured data. The core of Jraph is the GraphsTuple data structure, which enables users to define graphs with arbitrary node, edge, and global attributes, and to batch variable-sized graphs efficiently for JAX’s just-in-time...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Powerful Business Process Automation Icon
    Powerful Business Process Automation

    With ThinkAutomation, you get an open-ended studio to build any and every automated workflow you could ever need.

    When a message is received ThinkAutomation automatically executes one or more Automations. Automations are created using an easy to use drag-and-drop interface to run simple or complex tasks. Automations can perform many business process Actions, including: updating company databases, CRM systems and cloud services, sending outgoing emails, Teams & SMS messages, document processing, custom scripting, integration and much more. Over 100 built-in actions are included, plus ThinkAutomation is extensible with Custom Actions.  
    Learn More
  • 10
    Blend_My_NFTs

    Blend_My_NFTs

    Easily generate thousands of 3D models, images, and animation NFTs

    Blend_My_NFTs is an open-source, free-to-use Blender add-on that enables you to easily generate thousands of 3D Models, Animations, and Images. This add-on's primary purpose is to aid in the creation of large generative 3D NFT collections. It is the first and easiest 3D NFT generator. Blend_My_NFTs was initially developed to create Cozy Place, an NFT collection by This Cozy Studio Inc. Blend_My_NFTs works with Blender 3.2.2 on Windows 10 or macOS Big Sur 11.6. Linux is supported, however we...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Grow.dev

    Grow.dev

    A declarative website generator designed for high-quality websites

    Grow.dev is a static site generator optimized for building highly interactive, localized microsites. Grow.dev focuses on providing optimal workflows and developer ergonomics for creating projects that are highly maintainable in the long term. Grow.dev encourages a strong but simple separation of content and presentation and makes maintaining content in different locales and environments a snap.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Fairseq

    Fairseq

    Facebook AI Research Sequence-to-Sequence Toolkit written in Python

    Fairseq(-py) is a sequence modeling toolkit that allows researchers and developers to train custom models for translation, summarization, language modeling and other text generation tasks. We provide reference implementations of various sequence modeling papers. Recent work by Microsoft and Google has shown that data parallel training can be made significantly more efficient by sharding the model parameters and optimizer state across data parallel workers. These ideas are encapsulated in the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Whisper Library

    Whisper Library

    Whisper is a file-based time-series database format for Graphite

    Whisper is one of three components within the Graphite project. Whisper is a fixed-size database, similar in design and purpose to RRD (round-robin-database). It provides fast, reliable storage of numeric data over time. Whisper allows for higher resolution (seconds per point) of recent data to degrade into lower resolutions for long-term retention of historical data. Copies data from src in dst, if missing. Unlike whisper-merge, don't overwrite data that's already present in the target...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    Blankly

    Blankly

    Easily build, backtest and deploy your algo in just a few lines

    ​Blankly is a live trading engine, backtest runner and development framework wrapped into one powerful open-source package. Models can be instantly backtested, paper traded, sandbox tested and run live by simply changing a single line. We built blankly for every type of quant including training & running ML models in the same environment, cross-exchange/cross-symbol arbitrage, and even long/short positions on stocks (all with built-in WebSockets). Blankly is the first framework to enable...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    m2cgen

    m2cgen

    Transform ML models into a native code

    m2cgen (Model 2 Code Generator) - is a lightweight library that provides an easy way to transpile trained statistical models into a native code (Python, C, Java, Go, JavaScript, Visual Basic, C#, PowerShell, R, PHP, Dart, Haskell, Ruby, F#, Rust, Elixir). Transform ML models into a native code (Java, C, Python, Go, JavaScript, Visual Basic, C#, R, PowerShell, PHP, Dart, Haskell, Ruby, F#, Rust) with zero dependencies. Some models force input data to be particular type during prediction phase in their native Python libraries. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Google Cloud Vision API examples

    Google Cloud Vision API examples

    Sample code for Google Cloud Vision

    The cloud-vision repository is a sample code collection for the Google Cloud Vision API that shows developers how to implement image analysis tasks across a wide range of languages and platforms. It contains examples organized by language and environment, including Go, Java, Node.js, PHP, Python, Ruby, .NET, Android, iOS, and even a Chrome extension, which makes it especially valuable as a cross-platform learning resource. The repository demonstrates concrete image understanding use cases, such as landmark detection and mobile photo analysis with label and face detection, so developers can see how Vision API outputs are consumed in real interfaces and workflows. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    AugLy

    AugLy

    A data augmentations library for audio, image, text, and video

    AugLy is a data augmentations library that currently supports four modalities (audio, image, text & video) and over 100 augmentations. Each modality’s augmentations are contained within its own sub-library. These sub-libraries include both function-based and class-based transforms, composition operators, and have the option to provide metadata about the transform applied, including its intensity. AugLy is a great library to utilize for augmenting your data in model training, or to evaluate...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Binarytree

    Binarytree

    Python library for studying Binary Trees

    Binarytree is Python library that lets you generate, visualize, inspect and manipulate binary trees. Skip the tedious work of setting up test data, and dive straight into practicing algorithms. Heaps and BSTs (binary search trees) are also supported. Binarytree supports another representation which is more compact but without the indexing properties.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    AWS Step Functions Data Science SDK

    AWS Step Functions Data Science SDK

    For building machine learning (ML) workflows and pipelines on AWS

    The AWS Step Functions Data Science SDK is an open-source library that allows data scientists to easily create workflows that process and publish machine learning models using Amazon SageMaker and AWS Step Functions. You can create machine learning workflows in Python that orchestrate AWS infrastructure at scale, without having to provision and integrate the AWS services separately.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    Manticore

    Manticore

    Symbolic execution tool

    Manticore helps us quickly take advantage of symbolic execution, taint analysis, and instrumentation to analyze binaries. Parts of Manticore underpinned our symbolic execution capabilities in the Cyber Grand Challenge. As an open-source tool, we hope that others can take advantage of these capabilities in their own projects. We prioritized simplicity and usability while building Manticore. We used minimal external dependencies and our API should look familiar to anyone with exploitation or...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Model Search

    Model Search

    Framework that implements AutoML algorithms

    Model Search is an AutoML research system for discovering neural network architectures with minimal human intervention. Instead of hand-crafting models, you define a search space and objectives, then the system explores candidate architectures using controllers and population-based strategies. It supports multiple tasks (such as vision or text) by letting you express reusable building blocks—layers, cells, and topologies—that the search can recombine. Training, evaluation, and promotion of...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Smart Contract Sanctuary

    Smart Contract Sanctuary

    A home for ethereum smart contracts

    ...Contains smart contract sources for various networks, grouped by the first two chars of the contract address. A scriptable semantic grep utility for solidity (crunch numbers, find specific contracts, extract data) Semgrep is a fast, open-source, static analysis tool for finding bugs and enforcing code standards at editor, commit, and CI time, and now supports Solidity! A powerful online code search service that can be used to search the sanctuary without cloning.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    q - Text as Data

    q - Text as Data

    Run SQL directly on CSV or TSV files

    q is a command line tool that allows direct execution of SQL-like queries on CSVs/TSVs (and any other tabular text files). q treats ordinary files as database tables, and supports all SQL constructs, such as WHERE, GROUP BY, JOINs etc. It supports automatic column name and column type detection, and provides full support for multiple encodings. q fully supports all types of encoding. Use -e data-encoding to set the input data encoding, -Q query-encoding to set the query encoding, and use -E...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 24
    StreamAlert

    StreamAlert

    StreamAlert is a serverless, realtime data analysis framework

    ...Secure by design, least-privilege execution, containerized analysis, and encrypted data storage.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Interpret-Text

    Interpret-Text

    State-of-the-art explainers for text-based machine learning models

    A library that incorporates state-of-the-art explainers for text-based machine learning models and visualizes the result with a built-in dashboard. Interpret-Text builds on Interpret, an open source python package for training interpretable models and helping to explain blackbox machine learning systems. We have added extensions to support text models. Interpret-Text incorporates community-developed interpretability techniques for NLP models and a visualization dashboard to view the results. Users can run their experiments across multiple state-of-the-art explainers and easily perform comparative analysis on them. ...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB