ONNX October 2021 Community Meetup, 21 Oct 2021

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: 015 ONNX 20211021 Li ONNX SIG Model ZooNHub and Tutorials

Description

Event: LF AI & Data Day - ONNX Community Meetings, October 21, 2021

Talk Title: Model Zoo / Tutorials SIG Update

Co-Chairs: Wenbing Li & Mark Hamilton (Microsoft)

A

Hello: everyone. This is wenbin from microsoft. Today I will give an update for the model zoo and a tutorial sig.

A

This is agenda of the presentation. Firstly, I will give update about the modulus zig and mark from microsoft will also give a presentation about the onix module hub as.

B

A

In the last uh meeting up meetup that quantization, the mobile model is our fox area in uh in our uh next milestone. So and now we have a thanks for the some inter intel guys uh contribution. Now we have several quantized models, uh upload in a model zoo like the red net 50 and vg and chef net, and this is a spec particularly very useful for the mobile apps. As you can see the phone table, we take the resnet 50, as example, you can see with without too much loss on the accuracy.

A

The footprint and the latency are both uh greatly improved and those modes are generated by the inter neural compressor and the model can be influenced by only runtime without any extra configuration.

A

Also, we have some new um kind of the model which we called out of box model. This model will fills all the pre and post processing into the model. So when you do the inference, you don't need extra step on the pre and post processing.

A

This will greatly simplify the model inference and it is more portable because you don't need to worry about the those pre-processing libraries in any platform and especially useful for the mobile apps, and in this milestone we have checking the gbe tool um you in most of people. If you are familiar with gb2, you always require the some model generation algorithm. A typical is a beam search to get the final android result.

A

So in this example, we fuse the beam search into the gp2 model, so you can get the result directly without any extra processing. The output.

A

Then we talk about a little bit about the root map in the next half year. Firstly, we always welcome the model contributions, especially for this model state of art model and the model for the mobile apps, because it becomes much more and more popular now. Another way that we have a whole discussion about. How do we deal with the next operation in the models, because, when on module, zoo was funded away several years ago? At that time, the offset uh water is still very low compared to the latest.

A

Let us inference engine. We want to find some good solution for that. Finally, we also want to optimize the module uploading process to minimize the user only when they want to contribute their models.

A

That's all there has mark.

B

Hi, I'm mark hamilton, I'm a software engineer at microsoft, and today I want to share with you some work that my collaborator jason wang and I have done on creating the onyx model hub.

B

The goal of this work is to create a single line: api that allows users to quickly get started using pre-trained models in the onyx ecosystem.

B

We want to make this as simple and easy as possible by reducing boilerplate required for downloading models and loading them from the file system, and in particular, we hope that this work facilitates code sharing, so that developers can publish their models to the hub and users can download them the same way that they download anything else in the ecosystem.

B

We hope that this will enable people to build flexible applications on top of collections of onyx pre-trained models that wouldn't be possible without a unified api to connect them all. And finally, one of the goals of this work is to maintain parity with other ecosystems such as pytorch and tensorflow, which already contain model hub abstractions.

B

There's a few different things we considered when building the onyx model hub. First and foremost, we wanted our versioning scheme to align with the existing versioning scheme that the onyx models repository has in git lfs.

B

We wanted to support multilingual clients, because onyx is not just an ecosystem and python, but in a variety of different languages. This drove us to investigate a language agnostic protocol layer called the manifest it should support user and private hosted hubs so that anyone can deploy their own model hub and use it within their closed infrastructure, but still be able to leverage the common apis.

B

It should be secure and efficient. You know it should stop man in the middle of attacks with checksums and it should support local caching so that subsequent calls to the apis don't require redownloading the model. And finally, we wanted to make this easy for the onyx team to maintain. So we wanted to generate this manifest from the existing onyx models repository so that this could be done automatically and that this collection of models can be curated without any human input.

B

So, to quickly sketch out the main steps of the onyx model hub, it all starts with the onyx hub python, client that is built and at the onyx onyx repository users install that on their machines and then can type onyx.hub.load and pass in the name of a model along with additional parameters by default. This will point to the main onyx models repository where all these different pre-trained models are currently hosted and, in particular, we'll point to a manifest file. That's stored in that onyx model repository.

B

This manifest file then has relative links to all of the different models which then bottom out in particular files in git lfs.

B

So when one is queued for downloading that particular file in git lfs is downloaded to the model cache which is location on the machine, that's configurable and then loaded into the local process, which completes the loop and and returns a a final protobuf object that one can use for prediction.

B

Now that we've seen the main architecture, we can give a quick tour of the api. You can import the hub from the main onyx package and use that to load any of the models by name.

B

If you wanted a particular version, you can pass in the offset keyword and if you wanted to go even further and pin the entire thing to a particular fixed hash of the onyx model repositories for, say, reproducibility, that's possible as well, and you can also download from user repositories enabling private or custom style deployments in addition to downloading models. We also contain some utility functions for say, setting the cache directory for inspecting the information of these models prior to downloading and even querying the models by semantic tags like vision or detection.

B

And, finally, you can take a look at say the model size or other metadata for a particular model prior to downloading, so that you can understand the model before you deploy it to production.

B

A lot of the functionality of the hub relies on a manifest which contains metadata and locations of trained models, and in particular, this manifest is not hard to create. We were able to automatically generate it using the existing markdown tables on onyx models, and this allows us to suck in over 120 different models into our initial version of the hub, with metadata for check summing and inspecting the input output structure of these models and a variety of other features in this work.

B

One of our big next steps is to create a jvm based client for the onyx model hub. We currently have implemented a python client, but the jvm based client will allow us to make a really nice interface for our new apache spark based distributed onyx inference code, which allows you to evaluate deep, deep learning models within apache spark clusters very easily, and simply and second, we would love to incorporate some responsible ai information into the ontology and the metrics.

B

That way, people can kind of query for fairness on, relate on models and understand a bit about how their model performs in practice before deploying it into their application.

B

And finally, if you're interested in trying out this work, it's a simple pip install of onyx weekly to get the latest bits which have the onyx hub, and you can check out that link for pointers to more documentation and more information, and please feel free to drop us a line.

B

If you have any feedback about the project or want to kind of continue moving it forward and thanks, as always to the the onyx steering committee and everyone who helped along the way to provide us feedback, help us navigate the build system and help us get this contribution into the main onyx repository. So thank you all.