Insights on AI Cloud Orchestration

Amar Kapadia

Jan 29

·

5

read

From RAGs to Riches

Find out more

A Unique RAGOps Opportunity for NSPs to Offer RAG to their Enterprise Customers

Enterprises are going to embrace GenAI, of that there is no doubt. GenAI will add value in just about every function of an enterprise. The speed at which an enterprise adopts GenAI will clearly result in a competitive advantage. However, a more durable and lasting competitive moat will result by blending enterprise data with the GenAI model. The more data an enterprise can utilize for GenAI, the deeper their competitive moat.
‍

There are two options for an enterprise to mix corporate data with the GenAI model:

Fine tune an existing Foundational Model: In this option, an enterprise fine tunes a private copy of an existing GenAI Foundational Model (FM) with their own corporate data. Though much simpler than training a new GenAI model, which we are not even considering, this option is difficult for most enterprises. It requires GPUs in the tune of $Ms, a high degree of skill set to set up Large Language Model Operations (LLMOps) pipelines, and the need to continuously fine tune the model to prevent it from drifting or getting stale.
Retrieval Augmented Generation (RAG): In this approach, an enterprise uses a lightweight Foundational Model (FM) that has generic natural language processing capability but no real domain knowledge. Users will then supplement the prompt with real time augmented data to get a meaningful result. Finally, RAG can also prevent hallucination by citing the exact data source(s). However, this approach is network heavy in that with each prompt there may be a large amount of traffic to retrieve the relevant data.

In that sense the two approaches are analogous to the following images:

*Fine tuning an FM is akin to tapping into an* ***intelligent employee who has been fully trained in your corporate data****. Of course, they need to be trained on an ongoing basis to stay current.*

‍

*RAG is similar to hiring an* ***intelligent employee/consultant who doesn’t have prior knowledge of any specific domain****, but is fast enough to read any information you want in real-time.*

‍

Given the above: Most enterprises will use RAG

‍

There are three deployment models for RAG:

Public model – In this option, a public model e.g. Microsoft is used for RAG. The public model will use corporate data to provide the response. The fly in the ointment is the requirement to move all the relevant data to a public GenAI service provider. Some enterprises might be comfortable with this but most will not be for a variety of reasons.
Private model in a public cloud – In this approach, an enterprise uses a private FM in a public cloud along with other components such as vector databases. This is convenient but again, all the data needs to be shipped to the public cloud. This is perhaps less scary than the previous option since the data would reside in a private repository; nevertheless, it is a lot to swallow.
Private model in a private cloud – In this option, the enterprise would use a private FM along with other components like a vector database in a private cloud. What makes this approach attractive is that the private cloud already has all the required network connections to internal data sources. However, this approach does require a bit more sophistication on the part of the user to deploy and manage RAG.
‍

From the above, it is clear: A RAG model in a private cloud will dominate

‍

Enter Network Service Providers (NSP)

Unlike ML/LLMOps which require significant ML expertise, RAG does not. In fact, RAG requires expertise in data connectivity since the value of a RAG model is directly proportional to the amount of corporate data made available to it. Who better to provide managed RAG than the provider of SD-WAN and managed IP networks?
‍

NSPs are best positioned to offer managed RAG

Getting Started with RAGOps

RAGOp may be summed up as DevOps based methodology to deploy and manage a RAG model. RAGOps requires the following steps:

To expand a bit more:

Deploy virtual infrastructure with GPUs to host the RAG model. This may be a combination of virtual compute (containers, VMs), storage, virtual networks, and Kubernetes/hypervisor layer.
Deploy an FM along with a vector database, text embedding, and other data sources.
Deploy supporting guardrail/management/monitoring components.
Set up data pipelines to collect Enterprise data from diverse sources and populate the vector database.
Monitor and manage (upgrade, scale, troubleshoot) the environment over Days 1,2 as needed.

Since NSPs can provide data connectivity, they hold a competitive advantage. However, the competitive advantage NSPs hold will not last forever. For this reason:
‍

NSPs need to start RAGOps PoCs for enterprise customer ASAP

‍

Next Steps

Contact us for help on getting started with RAGOps.

The Aarna.ml Multi Cluster Orchestration Platform orchestrates and manages edge environments including support for RAGOps. We have specifically created an offering that is suitable for NSPs by focusing not just on the FM and related ML components, but also on the infrastructure e.g. using Equinix Metal to speed up deployment and Equinix Fabric for seamless data connectivity. As an NVidia partner, we have deep expertise with server platforms like the NVidia GraceHopper and platform components such as NVidia Triton and NeMo.

Amar Kapadia

Jan 4

·

2

read

Aarna.ml 2023 Highlights and What’s to Come

Find out more

Happy New Year to all! Hope 2023 was a good year for you and 2024 will be even better.

For Aarna, we closed 2023 with several major accomplishments:

Pushed our flagship product, Aarna.ml Multi Cluster Orchestration Platform (AMCOP), into production
Established ourselves as #1 Private 5G orchestrator (successfully implemented E2E P5G solution in partnership with Druid and Airspan)
Established ourselves as #1 ML + O-RAN SMO through collaboration with NVidia and interop exercises at TIP, Digital Catapult, i14y, O-RAN Plugfests
Released a beta version of our cloud edge orchestration SaaS product, Aarna Edge Services (AES), initially targeting the Cloud Adjacent Storage use case followed by Cloud Adjacent GenAI (RAG), edge⇔multi cloud networking, and more
Recognized for contributions to Linux Foundation project, including Nephio (where we are the #3 contributor), Akraino, and 5G SuperBlueprint
Hired 15+ new staff
Raised Series A financing
‍

For 2024, here’s what we are working on:

Solidifying our position in Private 5G
Building on our edge ML work to establish ourselves as the #1 Edge ML orchestration company
Providing comprehensive cloud edge orchestration features to cover storage, networking, and GenAI/ML use cases
Expanding Nephio to a number of enterprise use cases by collaborating with other open source communities such as Kubernetes, OpenTofu, Ansible, LF AI & Data Foundation and more
‍

If you are looking to join our journey (as a customer, investor, partner, advisor, employee, press/analyst, or other) please reach out to us.

Regards,

Amar & Sriram

Brandon Wick

Dec 13

·

3

read

Getting Started in GenAI with a Private, Zero-Trust, Fully Managed LLM

Find out more

In June of 2023, we announced a partnership with Predera to offer a packaged Generative AI solution to the industry — a private, and fully managed LLM with Predera’s AIQ MLOps Platform modern toolstack and Aarna.ml AMCOP for zero touch orchestration, configuration, management of upgrades/updates.

We’re now pleased to announce that NetFoundry has been added to this offering, bringing a zero trust security approach, built on Ziti (which comes in both open source OpenZiti or CloudZiti SaaS). Hosted in an Equinix data center, all connections are made using software-only zero trust endpoints, using outbound connections and ‘authenticate-before-connect’ making it ‘dark’ to the internet with no inbound ports. This provides a security posture beyond those of Managed Commercial solutions, with a user experience as simple as ‘it just being available on the internet’. Get the Solution Brief.

This significantly strengthens security and controls as unauthorized attackers have no network access by which to exploit the data. The solution also includes, built-in identity, authentication and authorization, least privilege access, granular visibility, and audit controls.

The GenAI offering also provides resources such as Intel Xeon 6338 processors and NVidia A100, hosted on Equinix Metal. Users can choose between LLMs like Llama, Dolly, or NeMo, support services for model fine-tuning and operations, a management dashboard, and now a zero trust security overlay.

Businesses today need to move fast and start taking advantage of the GenAI revolution while avoiding security threats and bogging down LLM adoption with overly-complex configurations. Get the Solution Brief to learn more and start building your own private, zero trust, fully managed, LLM for GenAI.

To better understand zero trust security, please see these assets from NetFoundry:

Yogendra Pal

Dec 13

·

read

ONE Summit Regional Day: AI Networks, Private 5G, and more

Find out more

^{Blog recap by Yogendra Pal and Sriram Rupanagunta}

‍LF Networking’s ONE Summit Regional Day was hosted by Infosys, on Nov 30th, at their beautiful campus in Bengaluru, India. As part of this event, Sriram Rupanagunta was asked to present his thoughts around “AI for Networks and Networks for AI” as a panelist along with other technical leaders in this space. This was very much aligned with the event theme “AI powered Networks”. The summit also focused on various technical talks around LFN open source projects and how these are enablers for enterprise, 5G and other use cases. Yogendra Pal presented insights on “The LFN 5G Super Blueprint: Overview & AI Exploration”. We would like to thank LFN and Infosys for their invitation and congratulate them on hosting this great event to foster open source networking collaboration in India.

**Insights on the “*AI for Networks and Networks for AI”* panel**

In our experience, the use cases for AI in networking are mainly in the Troubleshooting and Optimization areas, which include examples such as RAN failure detection (using RAN data & weather data), and Slice optimization
Some of the challenges being faced are the availability of real life data for training the models, and converting POCs to production deployments (by proving the effectiveness of the predictions)
In terms of the Infrastructure requirements for running these AI workloads, in the recent past, our NVIDIA GTC demo was showcased, wherein RAN & AI workloads are orchestrated and switched on the same hardware with potential use cases for workload and compute optimization. This leads to another challenge in orchestrating the necessary infrastructure and switching the workloads, where the LFN projects such as Nephio will play a key role
In terms of trends using GenAI for the networking and for Enterprises, there is increasing usage of retrieval-augmented generation (RAG) for augmenting the model, which reduces the need for training the models using enterprise data or tweaking the LLMs
The format of the data, which is another challenge, needs standards such as ORAN (for providing data in a vendor-neutral form)

**Insights on “*The LFN 5G Super Blueprint: Overview & AI Exploration”* presentation**

The slides from this session are available here.

The 5G Super Blueprint ecosystem and landscape are shown below, wherein various open source components have been part of continuous development in the orchestration space inclusive of Nephio, EMCO, and ONAP to name a few.

LF Networking has recently compiled a list of on-going and completed blueprints into a 5G Super Blueprint library, which helps to enable enterprise and Private 5G networks use cases across the ecosystem:

Secure IOT Deployments (SEDIMENT + KubeArmor)
Simplified End-to-End Network Slicing (EMCO) led by Aarna.ml
5G End-to-End deployment of eBPF based UPF with Free5GC Core, UE & Radio
L3AF Integration with 5G-UPF
Factory Floor Safety (visual inspection use case)
ORAN SMO Package - Combining multiple open sources into an E2E package
5G Public Cloud Edge Interface (Free5GC Core and UPF deployment orchestration to PCEI using EMCO, demonstrates network slicing led by Aarna.ml
5G Cloud Native Network (demonstrates the value of network slicing using video packet delivery)

In the LFN 5G Super Blueprint, there is ongoing effort towards enterprise to showcase the value of 5G for connectivity and enable various use cases. Aarna.ml has been a key contributor to this effort over the last 5 years providing the orchestration and management for several interactions and keynote demonstrations that have been shown live to thousands of community members. To see our latest contribution in this regard, refer to this presentation from the recent LFN D&TF event: 5G SBP: Orchestration of OAI Core and Amarisoft gNB with EMCO.

Orchestrating various components like O-RAN, 5G core, and edge applications requires an integration of management elements and a 5G strategy. Balancing performance with cost, and integrating multiple technologies into a unified system, are also key concerns.

To address these challenges, Aarna.ml offers AMCOP, an open source orchestration, lifecycle management, real-time policy, and closed loop automation platform for edge and 5G services.

Are you experiencing challenges pulling together a Private 5G Network? Aarna.ml expert team is here to help. Request a free consultation to discuss how to create value with Private 5G for your specific requirements and use cases or request a Free Trial of AMCOP today.

‍

Namachi Sankaranarayanan

Dec 7

·

7

read

Aarna.ml Unveils AMCOP 3.4: Advancing Edge Orchestration by fortifying Security, embracing Standards, and more

Find out more

Aarna.ml today is announcing the release of Aarna.ml Multi Cluster Orchestration Platform (AMCOP) version 3.4, a pivotal milestone in advancing zero-touch edge orchestration. This release introduces a myriad of enhanced features, improvements, and additions, solidifying AMCOP's capabilities in managing complexity at scale.
‍

Role-Based Access Control (RBAC)

RBAC emerges as a linchpin in security, regulating network access based on organizational roles within Service Management and Orchestration (SMO) in the O-RAN architecture. RBAC not only adds an extra layer of security but also efficiently distributes superuser capabilities across administrators through meticulous privilege management.
‍

O1 Functions and NACM

In O-RAN deployments, the sensitivity of O1 functions necessitates adherence to zero-trust principles. The O1 interface, enforcing confidentiality, integrity, authenticity, and least privilege access control through encrypted transport and the Network Configuration Access Control Model (NACM), thus ensur secure network operations. This standards-based mechanism restricts user access to predefined NETCONF operations and content, integrating authentication and authorization seamlessly.
‍

OAuth 2.0 for Access Management

OAuth takes the reins in generating authorization tokens, managing access for distinct roles within the system. This introduction of an authorization layer, separating the client's role from the resource owner's, ensures secure access to protected resources. Utilizing Access Tokens issued by an authorization server, OAuth adheres to industry standards, providing a robust mechanism for secure resource access.
‍

Keycloak for Authentication and Authorization

Keycloak, a robust open-source identity and access management solution, stands as the AAA provider for Aarna SMO. Within Keycloak's administrative realms, the roles, such as 'system-admin,' 'fault-admin,' and 'performance-admin,' define permissions, ensuring secure authentication and authorization for contemporary web applications.
‍

NETCONF Access Control Model (NACM)

NACM, a standardized approach, ensures robust access control mechanisms within the NETCONF Server. Adhering to industry standards outlined in RFC8341, NACM introduces predefined access control groups aligning with distinct NETCONF client roles, prioritizing compatibility, reliability, and adherence to established industry practices.
‍

In this release of AMCOP, ORAN Specified RBAC/Security Requirements as per O-RAN.WG11.Security-Requirements-Specification.O-R003-v06.00 and MPlane O-RU Device Requirements as per specification - O-RAN.WG4.MP.0-R003-v12.00 are met. The solution architecture, as depicted in Figure 1, showcases the implementation of RBAC with users, roles, domains, and policies.

‍

*Solution Architecture of the modules specific to RBAC requirements*

In conclusion, AMCOP v3.4 not only addresses security requirements but also enhances orchestration capabilities. The adoption of industry standards and the meticulous integration of access control mechanisms underscore Aarna.ml commitment to providing users with a secure, interoperable, and globally accepted platform for network orchestration. For more details on device-level access requirements, refer to the O-RAN specifications - O-RAN.WG4.MP.0-R003-v12.00.
‍

This release reaffirms Aarna.ml dedication to innovation, security, and the seamless orchestration of multiple network elements, further solidifying its position as a leader in the evolving landscape of network management and orchestration.

Learn more about ACMOP and request a free trial.

Amar Kapadia

Nov 30

·

read

Private 5G at IEEE Future Networks

Find out more

I was asked to speak at the recent IEEE Future Networks World Forum 2023 event, Nov 13 -15 in Baltimore, MD and hosted virtually on the topic of “Full Stack Automation in 5G & Beyond”. I‘d like to thank IEEE for their invitation and congratulate them on hosting another successful conference.

My talk was focused on orchestrating and automating Private 5G networks built on a foundation of open source projects. Private 5G is a core solution area for Aarna.ml and we've been building the underly architecture and orchestration and management elements for several years. Because no one vendor can do it all, we've been building partnerships across the open networking stack and ensuring that our assets are built on open source, integrated through APIs, and ready to deploy in production environments.

This blog represents a brief recap of my talk; you can also view the slides here.

Understanding Private 5G Networks

Private 5G represents a significant step forward in wireless technology, tailored specifically for business needs. Unlike public networks, it offers enhanced control, superior security, and the ability to be customized. Its strengths lie in its low latency, high bandwidth, and capacity to support numerous devices, making it well-suited for applications with demanding network requirements. Private 5G is a tool for businesses to elevate their operational capabilities, particularly in areas of data management and communication.

Private 5G Architecture using an AMCOP Orchestrator

Key Requirements for Private 5G Success

In order for Private 5G to achieve mass adoption, I believe it requires 3 things:

Zero Touch: Private 5G needs to be extremely easy to use
Cost Effective: Over time, private 5G needs to be in the <$0.50/sq. ft. CAPEX and <$0.20/sq. ft. OPEX range
Application Centric: Instead of network connectivity, Private 5G’s role is connectivity to edge apps

Open Source

We’ve found that many of the components required to build Private 5G Networks can be found in open source communities. Here is a sample of the communities we work with:

Linux Foundation Networking (LFN) Nephio with OpenTofu and Ansible for orchestration
CNCF style monitoring (logs, metrics, alarms, tracing etc.)
Kafka & CNCF OPA for closed loop automation
LF AI&Data Janusgraph for inventory

Private 5G Orchestration

Private 5G is not without its challenges. Orchestrating various components like RAN, 5G core, and edge applications requires a comprehensive management strategy. Balancing performance with cost, and integrating multiple technologies into a unified system, are also key concerns. To address these challenges, Aarna.ml offers AMCOP and AES.

AMCOP simplifies the orchestration of 5G network components and enhances lifecycle management and automation, aligning with the need for simplicity and reduced manual input. AES, a streamlined SaaS platform, reduces operational complexities and integrates with public cloud services, contributing to cost-effectiveness and supporting an application-focused approach. Together, AMCOP and AES offer robust solutions to make Private 5G networks manageable and financially viable.

Get In Touch

Are you experiencing challenges pulling together a Private 5G Network? Aarna.ml expert team is here to help. Request a free consultation to discuss how to create value with Private 5G for your specific requirements and use cases or request a Free Trial of AMCOP today.

‍

Resources

resources

Blog

A Unique RAGOps Opportunity for NSPs to Offer RAG to their Enterprise Customers

‍

Enter Network Service Providers (NSP)

Getting Started with RAGOps

Next Steps

Blog recap by Yogendra Pal and Sriram Rupanagunta

Role-Based Access Control (RBAC)

O1 Functions and NACM

OAuth 2.0 for Access Management

Keycloak for Authentication and Authorization

NETCONF Access Control Model (NACM)

Understanding Private 5G Networks

Key Requirements for Private 5G Success

Open Source

Private 5G Orchestration

Get In Touch

^{Blog recap by Yogendra Pal and Sriram Rupanagunta}