Cloud Native Computing Foundation KubeCon + CloudNativeCon + Open Source Summit China 2021, 8 Feb 2022

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: Keynote: Sailing with K8s Armada: Multi-Cluster Management with Massive Amounts of Nodes– Yifan Shen

Description

Don’t miss out! Join us at our next event: KubeCon + CloudNativeCon Europe 2022 in Valencia, Spain from May 17-20. Learn more at https://kubecon.io The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.

与K8s舰队一起航行，海量节点的多集群管理 | Sailing with K8s Armada: Multi-Cluster Management with Massive Amounts of Nodes – Yifan Shen, PaaS Cloud Platform Architect, ICBC & Kevin Wang, Lead of Cloud Native Open Source Team, Huawei

A

Hello: everyone, I'm kevin wang leader of the huawei cloud native, open source team, so glad to be here to share with you I'll start with introducing our speakers. For me, I focus on developing open source projects and maintaining the community. I'm a cncf ambassador and the co-founder of several cncf projects, including comeda.

A

Next shenyifan will join me to talk about the multi-cluster management using camera.

A

Okay, thanks kevin, I'm shanghi fan from the software development center of the industrial and commercial bank of china or icbc. I work on the design and r d of the icbc container cloud platform and I'm also the co-founder of the carmeda project.

A

First, let me start from the construction of icbc's cloud platform. Icbc cloud services support the running of many applications, including online activities on holidays, core service applications, mysql redis and other technical support applications and the ones in blockchain, ai and some new technical fields.

A

Icbc has been developing a customized cloud platform based on mainstream open source projects, ensuring overall independence and controllability. It is also the largest container cloud in the industry, with more than 280 000 containers deployed, icbc services run with specific requirements and on a large scale, cloud-native infrastructure.

A

There are four typical requirements, namely high availability or ha deployment, cross-cluster auto-scaling, cross-cluster scheduling and a specific dependency on kubernetes versions. Here's how icbc's cloud native infrastructure is running now.

A

First high requirements on the reliability of a single cluster, the number of nodes in a single cluster is less than 2000 to reduce the fault domain of a cluster.

A

Second, rapid growing of resource pools as services are migrated to the cloud new and core applications have been developed or running in the cloud, and existing applications are being migrated.

A

Third production level, heterogeneous clusters, icbc services depend on specific kubernetes versions and we are using a large number of heterogeneous, cni's csis and some heterogeneous underlying hardware.

A

Fourth, our banking services are distributed in multiple regions, centers and clouds, icbc cloud services run in the headquarter, branch and ecosystem clouds in terms of fault domains, geo-redundancy data centers are constructed and there is a finer grained division of multiple fault domains in each data center.

A

For now, the number of kubernetes clusters in icbc has reached more than 100. They are managed by the container cloud platform in a unified manner. However, we also face the following four problems:.

A

First, limited availability, a kubernetes cluster is also a fault domain, so automatic recovery across fault domains is required. Second, limited resources, only application, scheduling and auto scaling are available only to single clusters. Third, non-transparent clusters with heterogeneous resources, fault domains and other attributes configured for clusters. The service team has to distinguish all the underlying clusters to find out the target cluster where they want to run services.

A

Upper layer. Applications are not aware of cluster differences. Fourth duplicate configuration: although our services are configured on the cloud management platform in a unified manner, the specific configuration needs to be delivered to each cluster, which must be synchronized to address these challenges. We formulate the following objectives for multi-cluster management.

A

For multi-cluster management, we want to manage all clusters in one place and their entire life cycle and use unified standard apis for resource management. We need to support multi-version and comprehensive kubernetes resources and multi-dimensional resource override for cross-cluster, auto scheduling. It should be realized based on the fault domain and resource margin, partnering with cross-cluster auto-scaling for disaster recovery, cross-cluster resources should be able to auto-recover, and the control plane and service clusters should be decoupled for compatibility.

A

Smooth management is required for a large number of existing heterogeneous clusters and the open source software we use should be highly scalable and adopted by a large user base with these objectives. Let's think about how to achieve them.

A

First, commercial products bring in vendor lock-in which do not meet our requirements on independence and controllability so pass other open source software can be a consideration such as kuberfed. According to the survey on kuberfed, it uses non-native kubernetes apis, which makes it difficult to migrate existing clusters. In addition, its community is becoming less active. It doesn't become the de facto implementation standard in the industry, so we decided to develop camera to satisfy our needs as we are building the icbc financial cloud based on open source software.

A

We hope to closely work with members from the community and make more contributions to boost the open source development.

A

Some of you may wonder why don't we choose to go further with kuberfed but initiate a new project? Actually, we did develop the kubernetes federation version 3 at the early stage and quickly completed the prototype development. However, after communicating with many community users, we found that the federation itself cannot fully satisfy what's expected.

A

In addition to multi-cluster workload management, we wanted that federation has capabilities such as resource scheduling, fault, migration, auto scaling service discovery, data, automation and multi-platform cluster life cycle management to provide users with ready-to-use open source software for multi-cluster multi-cloud applications.

A

Therefore, a neutral open source project hosted by cncf, will be more suitable for long-term technology evolution and community development.

A

In terms of the core architecture, development of camera, we have learned from the experience of multiple co-sponsors in multi-cloud and multi-cluster management. We focus on the native api support of kubernetes and the scalability of the core architecture. The architecture of camera is similar to that of the kubernetes single cluster. In many ways, both of them have a control plane, an api server, a scheduler and a group of controllers.

A

The underlying layer manages single kubernetes clusters in kubernetes, the underlying layer manages single nodes.

A

The api server of camera provides kubernetes native apis and policy apis extended by comeda camera scheduler, focuses on fault, domains, cluster resources, kubernetes versions and add-ons enabled in the cluster to implement multi-dimensional multi-weight and multi-cluster scheduling policies.

A

In addition, the scheduler framework has a pluggable design so that users can customize and extend scheduling policies in terms of member cluster. Synchronization camera implements agent-based pull mode. In this way, the working pressure on the control plane can be effectively reduced and users can easily manage large-scale multi-cluster resource pools.

A

As for kubernetes clusters in various network environments, such as public clouds, private clouds, private networks and edge networks, camera also supports centralized api, calling such as execution controller or cube edge integration to implement network penetration and direct management of kubernetes clusters in these different networks.

A

To allow multiple clusters to run securely, camera has an execution space to isolate the access, permissions and resource views across multiple clusters.

A

Here are the core concepts in camera. All workload-related resource objects specified by users are defined in a resource template. They are exactly the same as native kubernetes apis, including deployment service and config map and crd objects that are enabled by operators.

A

In this way, users can use the yaml or apis in the original single cluster to create multi-cluster applications without any modification. The service platforms developed based on kubernetes apis do not need to be modified either. They can be directly transformed from the single cluster architecture to the multi-cloud and multi-cluster architecture, by interconnecting with camera.

A

For cross-cluster deployment models and scheduling camera provides independent propagation policies defined using a policy api that can be reused across applications.

A

For example, icbc has its independent platform team and service team. This model can well adapt to their organizational structures. The platform teams can set common policies for typical application deployment models, such as high availability model for applications with multiple fault domains.

A

The service team can still use the kubernetes native single cluster apis to manage their daily service rollout and version upgrade. Let me use an example to illustrate how you can use camera to manage your services.

A

This is a propagation policy in the definition of this policy. The platform team sets a resource selector to restrict all deployments.

A

If an application has a special label and its ha mode is multi-zone replication, the applications are strictly propagated to the three zones. The yaml manifest on the right is the api definition of a standard deployment.

A

From the two definitions, we can see that the platform team focuses on the settings of the common application deployment model and the service team focuses on the definitions of images versions and container pods in the applications.

A

Camera combines the requirements of the two teams to achieve cross-region ha of applications.

A

If any underlying cluster is faulty, the missing clusters or pods in the availability zone can be automatically supplemented with the dynamic scheduling capability to implement cross-culture fault migration.

A

During comada's positive cycle consisting of design, r d practices, redesign and r d icbc has summarized some of its advantages and lessons learned in the following four aspects: resource scheduling, disaster, recovery, cluster management and resource management. I think the following three points deserve special attention and are especially prominent in the actual implementation process.

A

The first is the binding and scheduling of multiple types of resources, which can ensure that kubernetes resources required by service nodes can be scheduled. At the same time, this greatly improves the real-time resource provisioning.

A

The second is the support for kubernetes native objects, which can ensure that a large number of kubernetes external clients do not need to be reconstructed.

A

The third is the support of the camera for distribution in pull and push modes which adapt to multiple scenarios, especially in the scenario with a large number of clusters. The pull node can greatly reduce the performance pressure on the camera control plane.

A

After talking about the current status of our project, let's take a look at the follow-up plans in terms of large-scale production. We hope that the container cloud platform will serve as a user-oriented platform and the underlying layer will manage and schedule resources of multiple clusters in a unified manner based on cometa. In this way, more than 100 existing kubernetes clusters, including heterogeneous clusters, can be managed. We also hope to make continuous contributions to the community.

A

We mainly focus on the feature which includes a smooth migration of existing applications and can be automatically incorporated into camera for federation. Second, we hope to continuously optimize and implement cross-cluster scaling application, migration and data linkage.

A

Of course, the camera project also has many other exciting functions. Welcome to the github of the karma project, to view our release, notes and community documents to learn more about the functions and details. If you have any suggestions or feedback when using camera, please join our wechat group and slack channel to reach out to us. That's all our sharing for today. Thank you for your time.