Unveiling the VENum Stack
In a significant advancement for data security and machine learning, Vaultree has unveiled the VENum Stack (Vaultree Encrypted Numbers), the community edition of an open-source framework that enables developers to perform machine learning (ML) on encrypted data without compromising performance or security. This release addresses longstanding challenges in processing sensitive information securely, marking a pivotal moment in the fields of cryptography and artificial intelligence. Vaultree invites AI & ML professionals to contribute to the library.
The Challenge of Secure Data Processing
Traditionally, analysing sensitive data with AI & ML models necessitated decryption, exposing information to potential breaches and complicating compliance with data privacy regulations. While Fully Homomorphic Encryption (FHE) allows computations on encrypted data, existing FHE schemes often suffer from scalability and performance limitations, hindering their practical application in real-world scenarios.
Introducing the VENum Stack
Vaultree’s VENum Stack comprises two core components:
- VENumpy: An internal FHE library that facilitates secure and scalable ML operations on encrypted data.
- VENumML: A Python library built upon Vaultree’s proprietary encryption scheme, designed to integrate ML capabilities with FHE seamlessly.
By combining these tools, the VENum Stack empowers developers, regardless of their cryptographic expertise, to execute advanced ML tasks securely, ensuring data privacy without sacrificing performance.
Key Features of VENumML
VENumML offers a range of functionalities tailored for encrypted data processing:
- Linear Models: Implementations of linear and logistic regression, optimised using stochastic gradient descent (SGD).
- Time Series Analysis (Phineus): Tools for Fast Fourier Transform (FFT) and gradient descent on encrypted time series data.
- Deep Learning: Support for transformer architectures, with features for facial recognition applications.
- Graph Analysis: Planned inclusion of algorithms like PageRank for encrypted graph data analysis.
These features enable the processing of various data formats—including images, tabular data, unstructured data, graphs, and time series—while maintaining encryption throughout the ML pipeline.
Open-Source Commitment and Community Engagement
Vaultree has open-sourced a community edition of VENumML to foster innovation, transparency, and collaboration within the developer community. By providing access to this technology, Vaultree encourages developers and researchers to contribute to the evolution of privacy-preserving ML applications and support on essential advancements in this field and Vaultree's upcoming powerful releases.
The community edition of the VENumML repository is available on GitHub: https://github.com/Vaultree/VENumML
Developers can explore the codebase, access documentation, and participate in ongoing discussions to enhance the framework’s capabilities.
Real-World Applications and Industry Impact
Vaultree’s VENum Stack is more than a technological advancement—it’s a practical solution to some of the most pressing challenges in data-intensive industries like financial services and healthcare. By enabling secure, encrypted machine learning, VENum empowers organisations to innovate responsibly while adhering to stringent privacy regulations. Below two examples of possible use cases:
Financial Services: Optimising Cash Management with Secure Forecasting
Managing sensitive data, like ATM transaction volumes, has traditionally required decryption, exposing organisations to risks of data breaches and regulatory penalties. With VENumML, Vaultree’s Phineus module introduces a groundbreaking solution: Privacy-preserving time series forecasting.
Using advanced techniques like Fourier Transforms and Linear Regression on encrypted data, Phineus enables financial institutions to:
- Predict ATM cash needs while keeping transaction data encrypted.
- Reduce operational costs and downtime by optimising cash replenishment schedules.
- Comply seamlessly with privacy regulations like GDPR and the Gramm-Leach-Bliley Act.
By securely analysing sensitive data, financial organisations can enhance decision-making without compromising on privacy.
Healthcare: Diagnosing Rare Diseases with Encrypted Transformers
Healthcare providers face a critical need to analyse sensitive patient data while maintaining strict compliance with regulations like HIPAA. Vaultree’s VENumML demonstrated its capability to address this with encrypted transformer models for natural language processing (NLP).
In a healthcare-focused demo, encrypted patient medical records were processed to identify rare diseases, such as Wilson’s disease. Key innovations included:
- Data Encryption via VENumpy, ensuring patient information remains secure throughout preprocessing, tokenization, and inference.
- Transformer Architectures are fine-tuned to analyse encrypted inputs, enabling advanced NLP tasks.
- Encrypted Inference to securely generate diagnostic predictions without exposing sensitive data.
This approach empowers healthcare providers to securely leverage cutting-edge ML tools, unlocking the potential for AI-driven medical innovation at scale.
Future Developments
Vaultree is committed to expanding the capabilities of the VENum Stack. Upcoming releases aim to include additional ML models and support for a broader range of data types, further empowering developers to create secure, privacy-preserving applications across various domains.
Conclusion
Vaultree’s introduction of the VENum Stack represents a significant leap forward in the integration of machine learning and data encryption. By open-sourcing VENumML, Vaultree not only provides a powerful tool for secure data processing but also invites the global developer community to participate in shaping the future of privacy-preserving machine learning.
Ready to Transform Your Data Security?
Discover how VEDS can revolutionise your data sharing and collaboration. Contact us today to learn more or request a demo.