DevoWorm Lab Meetings, 14 Dec 2020

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: DevoWorm (2020, Meeting 42): Input Data, Physics of Cell Temporality, Computational Bio Education

Description

An update to the DevoWormML lecture on Input Data, "Periodicity in the Embryo" manuscript, paper on the Physics of Cell Aggregates, and Computational Biology Education. Attendees: Susan Crawford-Young, Krishna Katyal, Mayukh Deb, Mainak Deb, Jesse Parent, Debojyoti Chakraborty, and Bradly Alicea.

A

Hello, hi hi. How are you.

B

Not too bad um I'm doing a carpenter project this week, so there was yesterday- and I finished my course- both.

C

D

What I was doing.

B

And the exam was saturday, but I.

A

Finished the course on friday and then the exam was saturday morning.

B

There were a few complaints.

A

B

Yeah, okay, so is anybody else gonna join us, you don't know.

A

We'll wait a couple of minutes to see who shows up.

B

I I had a couple of articles about um mechanics.

B

Okay, embryo development- maybe I'll just oh- I can tell you about them, but I can send them to you later.

A

Okay sounds good.

B

Well, I just have to find them yeah, uh let's see.

B

This was called mechanical coupling coordinates the code elongation of axial and paraxial tissues and avian embryos.

B

Okay, so I haven't really read them: they're just and then there's mechanical.

A

Forces direct stem cell behavior and development.

B

And regeneration, that's a 2017 paper.

C

B

A

Yeah go ahead and send them out to my email address.

B

uh Nick's, probably on his off off week, so he's not going to join us so.

A

B

A

E

Hey, how are you.

A

I'm doing okay getting ready for the holiday season and uh yeah I mean getting that sort of the end of the year. uh Everything in order uh you know reflecting on the year. I guess I think that's a good exercise to do.

B

Oh okay: well, I'm just doing panicked. um Carpentry projects.

A

B

Apparently, everybody wants cookies for christmas.

A

B

A

Yeah just creep up on you, it's like the just, I didn't realize it was the middle of december. Yet I mean.

C

I know like on.

A

The calendar but in my mind it's not.

B

Anyway, I need to get it. I guess the.

E

Baking won't take more than two days, so I should just.

B

Just do it done.

A

Okay, I don't think anyone else is going to come today, but.

A

Well, anyways yeah: we can get started on on some things. uh I know people will be watching on youtube as people can't make it to the regular meeting um so that'll be. You know, you'll be able to follow up on some of these things, uh either in the group slack or via email or some other means.

A

So I was going to start with a couple. Actually, our agenda includes a couple things. Today I was going to go through a update of a lecture that I gave during diva worm ml, I'm kind of going through those lectures and updating them.

A

This one's on input data, and so we had last week, was the nurbs conference, so people who are interested in machine learning and other things like that. That was a major conference in that field. So we had a discussion. I had a discussion with um jesse parent on saturday about recapping what was going on at that conference. So uh it was a pretty long discussion. uh If you're interested, I can send you a link to those materials, um but the input data. I think it'll be interesting. Both the machine learning people and people.

A

You know uh doing empirical science as well like collecting data, because it does give like a broader view of data analysis and and things like that, uh the periodicity of the embryo paper, uh that's something that I am working on, trying to get out by the end of the year. I'm gonna look we're gonna, look at that briefly, uh and then uh they have some papers that we'll talk about.

A

Oh there's, krishna, hello, krishna,.

B

I'll show off my volume, okay, I'll, find you yeah.

A

So yeah krishna, we were just talking about the agenda today. We're gonna do a oh. Are you.

A

There yeah you're, audible, okay yeah, uh so we were just talking about I'm going to do a lecture and put data, which is something that I did a lecture on during divormel, but I'm updating it for 2020 uh an update on one of the papers that we're working on and then uh papers.

A

So let me start. uh Let me share my screen.

A

I'll start with the input data lecture.

B

Okay, I'm just gonna pop out for a second.

A

Okay, so this is the input data lecture here, and so this was something that again was presented first in 2019 and now we're updating it. So this is uh uh input data or the idea of what comes what go. What comes out? What goes in so you know you've heard of the uh expression garbage in garbage out and that's you know if your data is bad, your model output will be bad as well. That applies to statistics as well as machine learning.

A

This is a picture of a system where you have data flowing between variables, and then I've stated this is a bayesian proposition. uh If you didn't catch that reference, so this is, I updated the devo zoo part. The first part I would mention would be that we have a reference for input data for our models. It's called devozu, and this is something that usually has worked on last year to update and I've worked on it with them, and we've made the input data that we have in our group more accessible to collaborators.

A

You just have all these different files. You know csv files that you can download and you can play with them and do analysis secondary analysis and we've curated the data so that it's in reasonable shape for reuse.

A

uh One of the things about like downloading secondary data, often is that often the data is too raw, and that is it's not refined enough to be in of any practical use.

A

Jesse is here all right, hello,.

A

A

F

Hello, I'm flying I just joined, and it's my first weekly meeting I joined yeah yeah welcome, welcome yeah! Actually previously I joined the community one year.

F

Create the inputs and make some good models to train it, but this year I want to do some meaningful contribution to the community. So that's why I jumped in.

F

A

Well, wonderful, yes, welcome, um just starting a lecture on uh input data, so why don't we.

F

Yeah, I think that's a pretty good time to join.

A

Yeah yeah great yeah, so yeah so yeah we're gonna. Let's go back to the lecture.

C

Here, why don't? I start.

A

Over I had it, it was about the second slide, so that's, okay. uh So to recap this is input data. This is the idea of you know you have to put in good data to get a good result out of your model. This holds true for statistics as well as machine learning or any other deep learning, any other kind of model you want to build, and so we have.

A

This is an update that something we had a course in 2019 called diva warm ml and I'm updating these lectures. So this is where this is coming from, so we have this resource called devozu again. This is where you have a bunch of data, sets that we've curated from different sources, a lot of cell tracking data, a lot of like lineage tree data and differentiation tree data and gene expression data.

A

As you can see, we have them for c elegans, but we have them for other species as well, and so you can play around with these data sets and they're they're well curated, so that it makes it easier to understand what you're actually analyzing.

A

But there are a lot of instances of open biological data. There are a lot of sources imagej, for example, as a series of public data sets that you can download. These are data sets that have been captured through microscopy techniques, uh various microscopy techniques. So they have. You know bright field microscopy.

A

uh You know, antibody stains, uh fluorescent images, so you can get all sorts of images to do a lot of processing and post processing on them. So you know they have data, sets that are raw data sets of numbers. They have data sets with images, and I think in these meetings that you've seen a variety of that kind of data. And of course you know it's, you know you might use raw image data for one thing: raw numeric data for another thing and combine them- I'm not going to get too much into that process.

A

But I'm going to talk a little bit about how you know some of the caveats to watch out, for uh there are also data sets like the avian vocalization data set here that I'm highlighting this is someone's work that they put up on um on kegel. So it's a kegel data set and it's uh just avian vocalizations from california and nevada.

A

So you can use different uh you can you know you can combine them, maybe with avian vocalizations from other places, you can use it to train a machine learning model.

C

And then there's.

A

The ssbd database, which is where we get a lot of our data, usually this is these- are data a lot of developmental data from various species. A lot of embryos are there, so the two criterion you want to think about when you're looking at open data is. Is it quantifiable?

A

So if you have a bunch of microscopy images, for example, is it quantifiable? Can I turn these images into numbers and can I do it reliably and the other criterion is? Is it enough data? So is it enough data to run my model and so in machine learning? That's a very big constraint because you need a lot of data to train your model, but even in statistical analysis you have uh you have to have a large enough end to make inferences about things, and so you have to always keep those in mind.

A

I'm going to go through these and then I'm going to check the chat so the four v's of data science. We kind of don't think about these too much but they're very important concepts to understand when you're working with data, the first one is volume.

A

So the volume of data is how much data is available, so this could be both samples or just bytes or megabytes or terabytes or whatever you know you you have in terms of the size of your data set now.

G

A

Doesn't mean that, because you have a huge data, set that it's really informative data, that's not really uh there's no correspondence between the two things. Necessarily it's just that you want to make sure that you have enough data that you that you'll need for your particular problem, but that's always an important issue. Another issue is velocity.

A

uh How much change over time in terms of movement or process? Does this capture so, for example, if you're looking at like a person running a marathon, you don't want to just have a little bit of data of them coming off of the blocks, uh or I guess you know, if they're running sprints, they come off of the blocks, but when they're just starting out and then that's all the data, you have that's not enough data to capture that entire process.

A

We see this a lot with embryos where you have a little bit of data at the beginning. You might analyze it, but it's harder to say things about. What's going on later in the process from those data, so you always have to keep that in mind that if you want to capture an entire process, you need to have representative data set and there's a lot of velocity really refers to the amount of change that goes on during that process. So there's a lot of change that goes on during that process, you're going to miss it.

A

If you only have a little bit of data, uh then there's variety, which is how much natural variation is captured by your data. So if you have like again a little bit of uh data representing, oh there's, mine, hello,.

A

I might as well check the chat now here. If there's anything in the chat, I don't think there is okay. uh So back to the presentation.

F

E

F

Data, like all the terms, are similar to the big data velocity variety yeah.

A

Yeah, that's where it comes from the big data, there's a kind of standard thing in big data, but when you know when you're, usually people are, you know learning about different things. It's it's something that kind of gets swept into the rug. So um right, yeah. It's.

F

A

F

It's a good idea too much.

F

To get a good model, then you also need a huge amount of data with good variety and velocity.

A

You'll make a note of that.

F

Actually, uh like I know the importance of the data when like in the you know, I needed the data of the like uh corona situation, like how many peoples are affected with corona, so some similar kind of the thing and some copd data. But there was lack of data. So I could not train the data.

F

I could not clean the model with uh the data and I need to create a dummy data set for that which is pretty like pretty bad, and the model was overfitting.

A

Right, yeah we're going to talk a little bit about like the uh like what we call pseudo data later on it'll, be kind of interesting to see uh morph and noise yeah there's my york, I love my hook, so, let's see so again, we have variety. So if your data set isn't representative of all the variety in the world or enough of it, that's makes it representative of the world, um then that can be a problem and then veracity is your data reliable before and after transformation.

A

If you have data, that is you know, it's uh not really something that looks like what's going on in the world. It's not going to be useful, and so all those things again, uh you know they're very useful in machine learning, but also in empirical uh statistical analysis as well.

A

So how do we know if our open data set is usable, and so these are the criterion that uh they present? The literature you know has to be available. It has to be what they call usable, which means it has to be documented as to have metadata has to be credible, has to be reliable. So you know you have to know that it's that it's it represents what's going on in the world.

A

um It has to be relevant, so it has to be like relevant to the problem um and then presentation, quality has to be readable and the structure has to be there. So that's one of the reasons why we do a lot of annotation. We do a lot of processing of secondary data is to make it uh you know, presentable and readable to people. uh There are a lot of times if you go to a repository.

A

They'll have standards for submitting your your data. If you create data and you submit it to a repository, there are a number of standards you have to follow. Sometimes it might be like a file structure that you have to use other times. It might be just like different pieces of metadata that need to be there.

A

So this is a machine learning training data now, and this is again we've. I think a lot of people in the group know the routine. Here you have some training data which we just talked about.

A

You plug it into a model, and then you make a prediction and of course it's not as straightforward as it seems you can have training data that you know isn't very good and it forces your model to under fit or overfit, and so you have to go back and refine the training data, and so but the training data is an integral part of this, um and so we have the in mnist database, which is a famous benchmark data set, and this allows for model training.

A

Now, let's generalize this to input data, so we have a ver set of various benchmark data sets for training. This is celebrity face data set.

A

This is the stanford cars data set, and then this is the iris data set, I'm not showing the full extent of the variation of the irs data set, but it's a series of these uh plants, these these flowers- that are um you know, that's it the you can use these data, these image data, decompose the data into features and then figure out what you know, uh what the shape should look like what the features of the geometry are and so forth.

A

Now you look at the car's data set and you look at celebrity faces and you see some of the four v's there. You see that you have. You know uh you have variety. Of course you have, I mean you might have velocity in the cars data set where you may have cars over many years, the different designs of cars. So if I show you a car from 1980- and I show you a car from 2020, those two cars might look stylistically different, but you know: can you both say that they're both cars?

A

Well, we usually say they're both cars? Hopefully you know if you've trained your model to on both types of cars, that they can also recognize that those are both types of cars. So those are the kinds of challenges you have in some of these input. Data sets. The data sets should have a lot of variation.

A

They should have. You know some time variation as well, and then they should also be representative of the phenomena you're trying to measure so that those are some of the things uh that you want to consider when picking an input data set- and we had another thing in the chat I believe- okay yeah so is minox- is something interesting to show regarding pseudo data, so yeah when we get to the pseudo data part, might not you might you might want to I'll? Let you present that so that's this!

A

uh This is, you know this has to do with data sets for training and machine learning model and, of course, if you're doing a statistical analysis, it's a similar issue. If you want to analyze something you know you want to have those four v's, uh you want to take those into account when you make the analysis and of course, if you do like a standard, statistical well, machine learning is a statistical analysis.

A

But if you you step back- and you do something like uh you know- some sort of classification, uh analysis or some sort, even like a an anova, these same things apply. You want to have a lot of variety. You want to have a large sample and you want to look at things.

A

Not you know you want to make sure you don't violate the assumptions of your test, but you definitely want some variation over time.

A

So this brings us to some more recent example of sort of machine learning and how input data play a role. So this is a new development from this month. This is alpha fold, and this is something that deep mind has developed they've.

A

So people have been trying to solve this problem for years on protein folding and it's you know where you try, try to figure out these proteins, these complex geometries and how they fold over time so proteins fold at a rather quick time scale and their what they call conformational structure makes a huge difference in how they function and so understanding how the folding process here is very important.

A

But you know we don't really it's a very hard problem, because it's very computationally intensive and there's a lot of variation here. You have a lot of parts of the protein and they fold at different rates, and you have different things going on. uh So it's it's a very hard problem and they've been having contests for years on this.

A

uh They usually they've usually done this by using super computers using physical models, very large uh physical models with force fields and things like that, where they've been able to model, maybe you know uh a small portion of like a protein's life where it's been. You know they can model folding. But it's not.

A

You know it's, it's not a perfect simulation of the process, and so uh what they've been able to do here with alpha fold is increase their performance um over previous years and so they've been able to take uh what they've done with super computers- uh and you know, force fields and other types of algorithms that they've used for this problem and they've applied deep learning to it and they've been able to get this performance increase.

A

But we have to understand that this progress comes from years of open data, open data input and open data investment, and so this is someone alad edwards who says awesome, but remember this wasn't by chance. Structural biologists did a couple things: uh the first of all. They set up a data hub 50 years ago, called pdb, which contains a lot of the pro they do like x-ray crystallography on the proteins.

A

So they get a sort of an image of the proteins in their structure, and then you know they have the data uploaded to this repository, where you have digitized versions of that data, and so then they've also too created a culture of data sharing. So everyone who images a protein will upload it to this database and so it's available for the community. So this is how they've been able to get protein data.

A

That's you know, diverse and really kind of understand what proteins are doing, and then they focused on data quality, so the pdb hub they enforce static standards, quality standards for input data and then for they ran competitions to benchmark prediction methods, which are these competitions. You saw earlier, uh which means that you know it's: it's forcing people to innovate and forcing people to be sort of adhered to community standards and then five contributed over 10 000 new structures to this exact end. So this wasn't.

G

A

Like an advance and deep learning by itself, the input data played a huge role in driving this innovation. So let me look at the chat again. We have uh another comment.

A

Yeah alpha full two is really big right now so yeah, I think that's a good uh exit, but it's also a good example, too, of uh input data and how it's really enabling a lot of these kind of advances. So these are pre-trained models. uh I made this slide last year, so I know we've done a lot of work on pre-trained models in the group since then.

A

Talk about pre-trained models is an architecture that is validated by a benchmark data set to solve a similar problem. So this is common across deep learning and non-linear programming where they use pre-trained models for different ends.

A

In some cases deep net or mask rcnn, the model is generalizable, can be generalizable to a host of problems, but not necessarily specific to a problem domain.

A

So we have this deep learning model, zoo of image, net resnet, inception and dgg, and here's some of the references and then talk about pre-trained models. They also allow for clearly defined features and classes to be built from a data of a specific types of you know, scenes or faces, or cars or tomatoes there there's an article highlighting 10, pre-trained models of data sets.

A

This is from a data analytics blog and then overall pre-trained models also allow for faster training, often with superior performance, but this may not always be the case, so we should, and then this in this article uh they tell you to approach pre-training models with caution, and so it's not all you know wonderful stuff in pre-trained model land. You know there are some caveats that you have to consider and a lot of those have to do with input, data and training, so free trained models, use input.

A

Data to you know, sort of sort of prime the main model for performance, but if you build your pre-trained model on bad data, of course, that can be a problem.

A

We also talk about data augmentation, so data augmentation is a way to take a data set that doesn't have very much variety and add variety to it, and so we did done some work on data augmentation.

A

In the group with embryos, examples or where you take a dog say, uh and you take this image and you stretch it and you put it at diff, you know you, you shift it to different orientations and you train them and you train your model, your pre-trained model or your deep learning or your.

A

You know your machine learning, deep learning model on these different variations, and so it's it's teaching the dog to, or it's teaching the model to look at the dog from different perspectives, not just from like a a vertical a vertical perspective, as you would see on the left.

A

Another example is where you take like say like a giraffe and you augment the background or the shading, so it you know doesn't expect to see a giraffe as like this just the static giraffe like this. It can see it from different different orientations.

A

You know if the sky is hazy in the image, it can pick the giraffe out or you can create masks which actually allow you to pick out the uh general outline of the feature. So you know, instead of looking at something that maybe looks like a giraffe, something that's spotted.

A

You know you might have a another animal in nature with this kind of coloration pattern and it might identify it as a giraffe. You can use this kind of mask to break it down into these geometric contours, and that also gives the model information so augmenting. The input data is important as well.

A

You can do things also like up sampling and down sampling.

A

You also have to take into consideration balancing your training set or input your input data set into different classes, so that each class is represented by a similar number of samples you might down sample to get you know, resolution you know to match the resolution of other sets or you might up sample so say. If you have one class where you only have a couple instances but other classes where you have a lot of instances, uh you can do these sort of strategies to to normalize.

A

For that- and so you know this is one thing you have to keep in mind when you have input data- that it's not always neat and even in terms of the classes that you're building, especially when you put data, sets together from different places, you might have some set. You know some classes that are overrepresented some that are underrepresented, so these are strategies to normalize. All of that, so now we're getting into synthetic and pseudo data, and so this is something that might not get mentioned.

A

He had an example of so the synthetic or pseudo data is a model of data that generates something not found in the original measurement or something that's not directly measurable and so an example would be interpolation between samples, dynamical, modeling of a hypothetical regulatory process. So I can uh systems biology will often use.

A

uh You know often have like some model that we use for regulation, but we only have some of the genes measured and so to measure some other things we might create, like some stochastic process so we'll generate some data representing the stochastic process will simulate the data and then you know maybe use some of the things we know about the intermediate processes to modify that stochastic simulation of the data. So you can do things like resample to re-sample real data through jackknifing and bootstrapping.

A

You can do this to create like a different variable or something that's approximating what you want to measure.

A

You can also use small sample inference which is distribution modeling, so you can take like something. You only have a couple of samples on and you can build a distribution, so you can say well. I only have a couple samples of this thing, but I assume that it's coming from a gaussian distribution, so I'll build a gaussian distribution around the parameters that I measure from my sam few samples and then sample from that distribution and generate new points, and I can assume that this is the way that the rest of the data is distributed.

A

So it's a good uh heuristic for my um data that I want to understand. And finally, you have things called data dependent prior, so you can use bayesian inference to generate priors for say, like a bayesian model. So this is where you observe again, you observe data, uh you build a prior and then you try to predict.

A

uh You know something from that prior, so I'm not going to get into bayesian inference, but that's another way to go. So how do you make a pseudo data set? So the first step is to create archetypical labels or, like figure out what you want to create in terms of a variable, and so you want. What do you want to measure? And so you want to create these and be clear about what you're trying to measure.

A

Then you use a distribution or a statistical distribution to create a series of plausible values based on estimates, so you might have like a mean and a standard deviation, and from that you can send. You know you can build like a some sort of distribution. You can build a gaussian distribution. You can build a.

A

Exponential distribution of some type but you're going to get a larger number of values. Those values will represent that process, but it won't be the real measurements so you're going to make that clear, and then you want to sample the ways to test type sample these values, a way that ways that test hypotheses and allow for variation be represented.

A

So you want to sample, and sometimes you want to resample or jackknife your data so that you can, you know, try to understand you know the variation that might exist there or to create. You know, data and ways that are more.

A

uh You know amenable to what you're actually trying to measure, uh but you also want to be able to you know, test hypotheses with it or to plug it into a machine learning model, and you know there are ways like we mentioned before. You know you can have under fitting and overfitting. If you don't do this process correctly, so um it's basically three steps and um you know, like here's an example of pseudo-labeling for something you know where someone's doing semi-supervised learning, which is where you have.

A

uh You know you have a lot of unlabeled data, but you have some labeled data and you want to kind of augment your unlabeled data, so you use a process of semi-supervised learning to do this, uh so you create labels from unlabeled data, so you're just guessing based on features in the unlabeled data set um and then where you can use a more formal set of labels to weed out false positives.

A

So you can take the label data, you put it into a model and then you train the model with label data, and then you use the train model to predict labels for the unlabeled data.

A

Then this results in something called pseudo-labeled data, and then you retrain the model with the pseudo-labeled data, which is where you have this process of labeling unlabeled data from a sort of an initial pass, and then that generates labeled data for you from from the model, and so this is this was covered in a blog post.

A

This is just one way to.

E

A

But really what you're doing is assigning categories to things that don't have categories you're doing this in the same way, you might try to identify features in a data set, or you know, do some sort of prediction from labels, and so so that's all actually now mynock had an example of pseudo data that he was going to share with.

A

A

A little bit right, yeah, I can hear you now yeah yeah. So can you.

D

D

Yeah, can you see the screen.

A

Yeah it's coming up now.

D

Yeah, so you are talking about.

A

Handwritten digits, which is quite used.

D

To clean models, so uh let's say you wanted to train again to generate some more editing. You know so like if you train this again. So if you try again, you get outputs like this. You know like uh these are not really they're, not really. The digits like these are some. They have features like this, but they are not really really just like, like some of them are trees or whatever.

D

That's all so, but but this this problem can be solved using something called conditional hands. It's like uh really new. This is a clean, so I'll show you another yeah. So these set. So these samples that you see here, okay, can you see.

H

D

Yeah yeah, so these samples that you see here using a condition again so the difference here, is that uh we are.

D

D

The generator and the discriminator, so what the generator is doing is that it takes in a randomized, vector and also the label, like suppose, for this image would be 6. The label, which is six, and it also takes in some randomize, and then it keeps up sampling through the convolutional s and.

A

It keeps on the output of six.

D

A

Generator, this is.

D

D

Digits and this chair and this discriminator is trying to sharpen its skills by training on the fake and the real samples and trying to discriminate between the real and the examples. So what the discriminator does is that suppose the generator has generated this six right, so this discriminator. This will take the input of the six and then.

A

It will pass it through these convolutional layers and then.

D

It will also take in the label embedding and then, if you output, then it could output the probability of the input image being fake or not. So, as we saw in the traditional cam in the.

A

Traditional gaming framework, the layers are considered and.

D

A

D

Looking digits like which, which don't really make any sense but like if.

D

On it, these look.

A

Quite good compared.

C

To the traditional.

D

Framework right yeah so, uh like you might be, you all, might be.

A

D

D

What happens if we move around like this? Let's say this vector keeps moving between two extremes like for one random or one sample of parallel. It will generate only one type of.

C

D

Like if we look.

F

Exactly the same, if we give the same level and same line.

D

But for let's say different separate part, then it will be something else so like so like, as you might see like. This is interpolating.

A

So this is another way you can generate data like not.

D

Just up sampling but like gas can also be a.

F

But I think cancer computationally intensive, sometimes.

D

Like as we are moving forward, gpus.

A

Are getting cheaper and cheaper.

D

It's like every day, like everything that I have done, is on google core app. So it's free. So it's something that it's something really really accessible now it wasn't a couple years ago, but nowadays.

F

Actually so that's yeah, but I think for like creating some uh leveling data for fake leveling data. If we can find some other easy ways to do it. I think that's a better way to try with that easy one. First and after that we can go to the cancer something like this, but because there is kind of a lot of uh like.

D

But yeah, if you do it, you get the best results using ads, yeah right, yeah, that's right, but right now the fact that it's not a factor that I guess.

A

Is the best way to go like you get the most accurate.

D

D

Suppose the user wants a generated image of an embryo of a certain class. So that's what that's, how conditioner has can be helped yeah.

F

And one question like uh like label will, like uh each level, is converted to an array when it is.

D

Yeah, just like anything else in the fight model, the information is.

A

Yeah post it in the chat as well just the link so yeah. I think that's a good thing for people to look over and you know try so yeah. I like that. I didn't even think about that those sorts of things uh but yeah. You know there are a lot of ways to create pseudo data, and this is of course one um if you're doing this. For you know it's just what I'm showing you is sort of a very general thing, uh more, maybe oriented towards statistical analysis but yeah.

A

There are a lot of ways to generate pseudo data and, of course, you know we're trying to the whole reason we're doing this is we're trying to get maybe more input data than we otherwise would have so we're trying to like trying to simulate the input data a little bit more, maybe augment it or do other things. So thank you, my knock for that. um uh That's very helpful, um and so so the next thing I want to talk about was uh you know now we're getting into sort of some of these things about.

A

You know what the sort of the pitfalls of small data sets are, and so one of the things we find is that there's this paper called lessons for artificial intelligence from the study of natural stupidity and they find that you know when you have data, even sometimes, even when you have enough data, but it's sort of biased in certain ways that you're not aware of your machine learning algorithms can replicate human biases that exist.

A

So you know if you have small and incomplete data sets, and sometimes it's not so much that your data set is incomplete, but it's a has difficulty interpreting rare events so like, for example, if there's a rare event in the world, something doesn't happen very often. If you train your data set on a lot of normalized data.

A

uh You know it's not going to pick it up so like in the mnist example say there. There are peop certain people who write their threes backwards, and you know that's a three but- and this is something you know how people sometimes cross their seven. Sometimes they don't.

A

uh If you don't train it on that ver than the variant and it's a rare variant in in the population, but it exists, then it won't be able to interpret that three. So that's one thing to to consider another thing is the ability, the machine algorithm's ability to learn from decisions is based on choice, contingent feedback.

A

So you know there are all these things in human reasoning, about risk aversion and taking consequences from false positives and false negatives and generalizing it, and so these are all things that machine learning algorithms can suffer from as well.

A

Finally, there's uh something called biased, inference and evaluation, and so this is what they call the attentional learning trap. That's where the better choice is ignored, because it's not obvious uh due to saliency or recency biases.

A

So, in other words, if something isn't really salient or something isn't recent, then it can be ignored, even if it's a better choice, and so um you know all these things we have to consider, they call it natural stupidity, because I think there's been an assumption that machine learning, algorithms are sort of a triumph on human decision making and some of the errors that we, you know from memory and decision making. But that's not exactly the case, and so this paper is kind of an interesting take on that.

A

But to think more about, like this idea of rare events, uh there's a lot of like literature in in machine learning and in human cognition. That kind of deals with this uh one of them is uh the out-of-distribution generalization.

A

So there's been a lot of work in in the machine, learning, literature and out of distribution generalizing from something that's outside of a distribution, and so these would be these rare events I just mentioned.

A

uh Usually, we will train our models and on distributions that are iid which are independently distributed um events that you know they're idealized, so they exist in that and maybe in a gaussian and but the you know, but of course the gaussians also have tails, and so those tails aren't very well sampled, and so, if you have something that exists in a tail you're going to, you know misinterpret it. So what they've done is they've trained models purposefully on these out of distribution events and trained it to generalize to those events as well.

A

So this is the example. You see where you know the these are sort of the outer distribution areas that are sparsely sampled. But if you make a point to sample those areas, you can up your improve. You can improve your performance on those rare events.

A

uh Another thing is burstiness, so we think of periodic events and we think of things, but but things in the world aren't distributed periodically. In other words, you know we think of things as happening sort of you know, maybe every five units or ten units of time, and so in the natural world. You have a lot of different types of distributions.

A

Some of them were uh poisson distributed, some of them were bursty and you can see the differences in how they occur, and so, if you train your model on sort of periodic set of events, you're going to miss a lot of this natural variation in how things happen and so not to get into the distributions here. But these are events that are clustered in time, but there's no meaning to the clustering. It's just that. That's the way the process operates, and so uh that's something you think about as well.

A

uh Then, of course, rare events and there's something called extreme value theory which, if you're interested in rare events you can understand. This is more of a statistical aside, but I would encourage you to think more about these sorts of things.

A

And then you know, when do small data sets work best? Sometimes you don't want to do all this processing of your input data. Maybe you just want to use a small data set, and so there's this paper.

A

Less data is more why small data sets hold the key to the future of artificial intelligence and they use an example from natural language learning, and this is these small data sets as a means for concept learning, and they say that in small data sets, sometimes you can use human machine collaboration to do things like classification and verification of the data, and so you know sometimes you'll have a small data set and you'll find something in the data using an algorithm and then you'll verify it with a person.

A

So it's you're not trying to generalize at the machine you're just trying to create, like you know, maybe extract features or find an interesting pattern, and then the human operator would uh verify that and say it's something. That's interesting or not, and this is something that allows for easier selection when users are selecting on these things presented by algorithms that are mining data.

A

Small data sets it allows them to find the correct answer in a small amount of sort of a you know, different iterations of the thing so, like you know, if I, if it presents me with a bunch of options, I can select those options and then I go to the next round and I select the ones that it comes up with in the next round and then after about two to three trials of that, you can converge to a correct answer and now then, in the original.

A

uh So you know going back to biology. Consider the following: spherical systems. Now we think of data, we've talked about the mna status and we've talked about the faces data set, but think about something like this. So there's a lot of input data here, potentially there's. You know this shape data, but we also have dynamical data. We have data on cell divisions, we have data on deformations of individual cells, we have movement data, we have a lot of things going on and these are two different embryos, and these are just. This is just embryogenesis.

A

So there's a lot of you know physical interactions, there's a lot of data that we can extract from this and uses input data to our models, and so there are a couple questions for application specifically to developmental biology and of the first one is: what does a grid training set? Look like what property should it have so should it be, you know, based on microscopy, should it be based on genetic. You know.

A

Data should be based on electrophysiological data, um our pre-trained models adequate and we kind of know a little bit about that from our work last summer. um They're adequate, but you know we. We still want to do more work on this. Do we need biologically specific models or multi-scale models and then, finally, what are the semantic aspects of biological data?

A

Are they just labels or are they annotations of function or are they shapes like the shapes? I showed you there where you have spheres- or you know blobs, you know, are movements semantic like when cells migrate or functions if cells have a different function, even if it's just kind of watching it, like uh you know, contract or watching it move. Is that a semantic thing? And so those are all interesting questions, and then I would finish with this. This is what we have now. This is a state of the art in the project. Divo learn.

A

This is where we have a pre-trained model: divalern 0.2. This is optimized to segment and analyze high-resolution microscopy images. This is some information on diva learn, but we also have a platform viva learn, which is this. um You know initiative to sort of bring together all of the machine, learning tools that we have in the group and allow people to learn around that library of things. So we have secondary data, we have these analytical tools and then we have an educational collection.

A

Like you know, data, I think a couple of you have contributed to the data science tutorials, and so we use that as a means to facilitate a lot of this type of thinking group.

A

So that's all I have on that and I I will probably be updating it again. I want to put some of your input in and I'm going to provide authorship, but you know I I we have a separate stub for this um on github, so you know I'll be I'll, be communicating with you about this in the future. I think this is a good topic for people to learn about, um and I don't really see it too often in other areas of um like you know.

A

We just had nurips last week and I don't think anyone talked too much about input data at the conference, um but it's definitely an important issue to talk about.

A

So I wanted to present on a couple other things today um and if you have to leave at the top of the hour, that's okay, I'm just gonna, keep going until we get some of these other items off the board. uh Thank you for your attention on that. uh That'll be useful for another round of revisions on that. um The first thing I want to mention is this periodicity in the embryo paper, and so this is a paper we've mentioned. For uh you know a number of months we've been talking about working on this.

A

This was a paper that we tried to do last year and it didn't really take off we're going to be submitting this to a special issue of a journal by the end of the year, so the end of the calendar year. So that's in about two weeks and I'm working on the draft.

A

Now um it's basically this idea that in embryos you have this sort of these temporal features, I'm calling them, and it's basically this one of the ideas I showed in the slides of how things are distributed in time, and so there's this idea of looking at cell divisions and how often they occur in the process of building an embryo when it turns out that they're very bursty and they have a lot of interesting properties, and so this paper is going to go through some of those properties. We use two different animal models.

A

We use the c elegans, which is our uh what our group is named after the roundworm, and then we have danny orrerio, which is a zebra fish, which is a model organism. It's a fish obviously, and it has a different type of embryogenesis, but we can make comparisons between the two and then we use simulated data, which is a very simple method, but it's uh informative with respect to some of these distribution-based approaches, and so the introduction goes over why this might be important how it ties into the literature.

A

Then I talked a little bit about some of the theoretical underpinnings of it. um I have some methods I have to flesh out, and then we have some figures. We have some data from c elegans, where you have these uh birth times distributed in. You know bins. So, like every five minutes of development, we have a number of these division events, and so that represents a bin in this diet in this uh graph, and so you can see that these are things that happen in the embryo. These are things that happen: post, embryonically.

A

And then you we have this interval analysis where we take the intervals between these and plot them out and try to figure out what the distribution of those intervals look like. And then we go back to zebrafish and we look at similar things going on in zebrafish at this uh time period, which are two developmental periods.

A

The closer inspection, the first developmental period looks like there's a lot of uh there's a sort of a regular pulsing here, but it breaks down later on in development, and so uh so this zygote stage it's more regular than the z uh cleavage stage.

A

uh So we kind of focus in on that. Then we do another uh analysis of the intervals and then kind of get into the synthetic embryo where we're generating a bunch of division times, based on different distributions that we assume that that these divisions conform to this is a uniform distribution of cell divisions versus some of the other types of cell divisions using other types of distributions.

A

um I have to change that label, but.

G

A

Is basically and then I'm gonna, this is basically the result. I'm gonna write up about that. Then the discussion, which is just kind of interpreting the data on the references, so it's pretty short right now. It needs more work. I'm going to be working on it. If you're interested in collaborating on this, I can send you a link to this doc and you can read it over give your comments. I'll, probably send it out for comments and slack before I submit it.

A

So that's and then- and you know, if you have an opportunity for authorship, if you can make comments or maybe make some additions or changes, then that's something that you can also. You know that's an opportunity for authorship, I'm pretty open with respect to authorship on these sorts of things. So that's that's that paper. um Hopefully we can get that done by the end of the year um and then we have some papers.

A

So usually in these group meetings we go over a couple of papers that were interesting in the last week or so, and we kind of review them very quickly uh just to give people a taste of. What's out in the literature and if you're interested in uh following up on this, I can give you a link to this folder.

A

So I'll put it in the chat and.

A

Okay, yeah we had some krishna wants to present something uh and jesse says if you need any review or copy editing. Let me know yeah we'll be talking about that. So.

G

Christian did.

A

You want to present something at some point.

C

G

uh Hi, uh you can put your papers I'll present afterwards.

A

Okay, yeah yeah uh yeah. Let me go through the papers, real, quick and then we'll do that. So one of the things uh I found this last night I was reading this. This is a blog post uh by someone called james summers and he says I should have loved biology and it's an interesting take he's a computer scientist, but he's not really that well-versed in biology and he's trying to understand.

A

He wants to really understand more about biology. So there's a lot of really exciting stuff going on so he's talking here about the state of sort of getting sort of into biology from a very um you know, baseline position. I guess uh you know we probably took high school biology, but there's just when you try to read an academic paper, it's almost impossible to really kind of get through a lot of the jargon.

A

So he says that uh he's starting a magazine assignment to answer some questions about sars, co, v2, which is, of course coronavirus and the immune system- and he reads an academic paper where they have a paragraph like this. So you know they have a lot of different abbreviations in here. They have some a lot of jargon, terms, total reads: mapping more than 300 times coverage across the 30 kb genome. What does that mean to someone who doesn't really know a lot about biology or genomics, and so just kind of you know.

A

The idea is: there's a lot there that that sits in the way of people really understanding what's going on, and so you know a lot of this is bouncing back and forth between wikipedia and the article. But how can you make this process easier?

A

And so um you know so he says, but biology like computing has a bottom and the bottom is not abstract. It's physical, it's shapes bumping into each other. In fact, the great revelation of 20th century molecular biology was the coupling of structure and function.

A

So this is something that then he talks about how you know you have a lot of good resources out there. People are doing things like animations cartoons, showing a lot of the processes, and you know going beyond these uh abstract acronyms to these really, you know things that really help you understand what's going on in the biology, and so he kind of advocates for this idea of you know how do we make this easier for people to understand? Coming in?

A

I thought this article was very relevant to this group, because that's what we kind of try to do in this group. One of the missions of the group is to make this stuff easier to understand, we're creating educational tools and analytical tools, and you know sometimes it's it's easy to to lose sight of the fact that there's a lot there's a lot of it's a lot of hard stuff, and this, I guess, holds for the computational stuff as well. You don't really understand not just machine learning.

A

Sometimes these meetings can be kind of a slog in that sense too, because you have a lot of jargon thrown around so you know he mentions that people. You know a lot of drawings are good. A lot of illustrations are good.

A

uh There's this book a copy computer scientist guide to cell biology by william cohen, and he mentions this book and I think maybe, if it's available online, I'm not sure uh you might want to give it a a read through if that's something that you're, if you're a computer scientist or even if you're not- and you only want to know cell biology- this might be a good way to kind of get a better appreciation for it, and so he talks a lot about people drawing with technologies.

A

So you know a lot of molecular biology even is based on illustrations and animations, and so those are very important parts of the of the learning process too, and so and so this last resource he mentions the eighth day of creation makers of the revolution in biology by horus judson.

A

So this is another book that might be interesting for people so that they, uh you know they.

G

A

Bit better foundation in their bio, biological thinking so um yeah. So this is a pretty good article if you're interested in education and and maybe like presenting your work more clearly um yeah. So there's that um and then um dick sent me he's not here this week, but he sent me this paper and arrested coalescence in multicellular aggregates, so multicellular aggregates are known to exhibit liquid-like properties.

A

These are like a bunch of cells that are sort of in an aggregate they're, not a multicellular colony per se, but they're sort of these aggregates and they exhibit liquid-like properties. So they look like a liquid in their behavior.

A

The fusion process of two cell aggregates is commonly studied. As the coalescence of two viscous drops. However, tissues are complex materials which usually exhibit viscoelastic viscoelastic behavior. It is known that elastic effects can prevent the complete fusion of two drops, a phenomenon known as a rested coalescence here.

G

A

The presence of this phenomenon aggregates of mouse embryonic stem cells and provide a theoretical framework which agrees with the experiments. In addition, agent-based simulations show that a cell protrusion activity controls a solid to fluid phase transition.

A

So this is where you have this, these sort of states of matter and now we're you know we think of them in terms of physics, but now we're thinking about this existing with biological cells as well, um revealing that the arrested colossals can be found in the vicinity of an unjamming transition.

A

So this is where, in there's, this field called soft materials and one of the things that they found in materials is that when you have these granular materials, if they get to a certain density, they undergo this, what they call a jamming phase transition, which means, if you have a low density of particles, they move around and they're free to move around and then at some density they stop moving around and they jam together, and so that means they can't move to get.

A

They can't move all of a sudden, they're restricted in their movement, and this happens very quickly. You know at a certain density, and so what they're talking about is this sort of transition between jamming and jamming? And so this is a really interesting uh sort of thing. So you can do this. You can analyze this with an agent-based model where you have a bunch of agents that are these cells, and I think we've shown these agent-based models in the group.

A

So you know that you can add, like cells at a certain density and they move around and you can simulate this process and you can actually observe these type of phase transitions.

A

So by analyzing the dynamics of the fusion process, it is possible to infer mechanical parameters of the aggregates, such as viscoelastic reaction time and the elastocapillary number. So these are parameters of physical nature of these cells.

A

Our work provides a simple in vitro method to characterize the mechanical properties of 3d, multicellular aggregates and sheds light in the impact of cellular activity on tissue mechanics.

A

So this sounds like something that susan would be very interested in um this is, uh you know really about the shaping process during morphogenesis how organs are built, and so this is an example. Here I don't know if I can zoom in on this, but you can see that they've got these cells, these are stem cells and then they form these cell aggregates and then they fuse together these aggregates, and this is what they call rusted coalescence.

A

And so they can measure this. um They can also simulate it with a agent-based model. They look at the different physical properties of these aggregates and when they come together, so they're coalescing together, they're sort of almost like the two clusters are overlapping, whereas when they're, not overlapping, there's no coalescence.

A

And then so, then they get into the agent-based simulations, so they use a kelvin voip model, which is a physical model they're trying to fit this to the experimental data. But then they don't really get enough information from that. They think it.

A

You know they think that the sort of the material properties of the cell aggregates are much more complicated than we can get from a single uh physical model, and so we turn to these type of uh agent-based models and so they're actually using the gpu to model this- and I don't know if they show graphs of this well, here's one here. I guess this is the region based simulation of these aggregates, so they have these red and blue cells.

A

You know each one of these cells is an agent and they're watching it they're coalescent the two clusters coalesce into a single cluster, and then they have this regime here, where they have protrusion strength and protrusion ratio, and they have this sort of map of the different phase regimes. So they have no coalescence arrested, coalescence, complete coalescence, and so you can build a graph like that. Where you have this transition- and this is of course, the transition here- this dotted line.

A

And so that's if you want to read more about that, that paper is in the folder as well, and then I wanted to talk about two other things before we move on. These are short things, so uh the first one is.

A

uh This is a book, that's old, um if you're interested in old books- or you know just sort of the history of developmental biology, there's this book modern theories of development from 1938. This is uh von bertelafny and whitaker, who were two classic authors in this area.

A

uh So this is, uh I mean some of the reasons might be dated, but it's uh they kind of talk a lot about sort of the current theories of development, and uh this is sort of a record of this book. So I just found this on. You know in my readings I thought that might be interesting to bring up some of the older classic books. This is one of the books. uh Again, this is almost 100 years old. So it's kind of you know.

A

Modern theories is kind of a bad name for it, maybe at this point, but um so they basically talk about. um It is pointed out that a crisis has been reached in biology in 1938 uh due to the rapid accumulation of facts without clear theoretical laws, and so this is the state that they found themselves in 1930, and maybe today, even the controversy between mechanism and vitalism is revived and is shown where each of these theories of life breaks down.

A

The author also considers the physiochemical explanation of single phenomena in the organism, and that does not suffice for the foundation of a theoretical biology, as it fails to establish laws to explain the arrangement of organizational material processes, and so there's this idea. If you want to, if you're interested in the history of theories, uh I know we have this theory building initiative that I've been talking about. It's really just a way to present this to people, and I think this will be part of that discussion. um Where you know people are interested in.

A

How do you build a theory because.

G

You can't really.

A

You know once we interpret the data that we analyze, we build us an analysis and then we present the results. Then we really need a theory to interpret, and so that's this is what this is part of, so I just wanted to bring this to your attention.

A

uh Finally, I want to point to this uh this open training course this has been around for a while, but I I was reminded of it uh dr ken ho he's he used to be at riken and now he's moved to england, and he was he's been in the group from time to time.

A

In the past uh he's been, I think, he's spearheading this uh ford, 4d cell, nuclear tracking online course, and so what this is is it's a it's a course on cell tracking, where they track the nucleus and in fact, a lot of the microscopy data that we work with in this group are based on these sort of cell tracking methods where they stay. You know they usually stain the cell with some marker, and then they can track the cell and microscopy images, so they can get it into a sort of a common framework.

A

So you can look at how cells move but you're tracking the nucleus, so it's not the entire cell necessarily a lot of times. It's just the position of the nucleus, the time that it appears the time that it divides, and so this is a way. This is a course it may be beyond if you're interested just in the output data, it may be beyond what you're interested in, but it's basically goes through a lot of the stuff that you.

G

A

They use matlab to go through a lot of these analyses, so they introduce 4d cell tracking as a concept and cell dynamics. Analysis techniques.

A

uh It doesn't assume you have any prior knowledge of programming, um but it does allow you to it, introduces these ideas for image, processing of 4d images for labeling interest regions and then tracking them over time.

A

And then this is this course introduces the use of this public database sspd, which is something that we use. It's a very good resource for data input, data for models, there's a lot of data on different organisms for development or for other types of cell cell biology, and so, uh if you're interested in that, maybe even if you're into machine learning it'd be good to go through these materials to get a sort of an idea of how the data are collected.

A

So it kind of goes back to our input data theme. So now, uh if you're krishna, if you want to present, did you want to present.

H

H

um Just give me one second,.

F

uh Bradley can you share something beginner friendly, like uh some topics, uh I can't understand like.

F

Something you uh slowly in the paper.

A

So can you save that link? Oh yeah? I can actually help. Are you on slack, okay? Okay. I.

F

A

F

Like something inappropriately for the new commerce or for the.

C

Computer scientist.

F

Who can understand.

G

Okay, okay drop your email. I have some, you know some things that are beginner friendly.

H

So uh is my screen was over, uh not you all.

I

Oh yes, original! It's coming up, yeah yeah.

G

Yeah, so I want to present this.

G

Yeah, so I've been uh working on this, you can say problem on bioinformatics, so here what I am doing, I am going to you know: do the pc of the genomic data to you know, estimate the ancestry.

G

So, first of all I tell you that what is thousand ohms project it was a project that came up in 2007. there. You know uh at least thousand volunteers of when a single ethnicity were considered and their genomic data was taken, and here here is an example of pc. That's principal component analysis.

G

So what I am going to do is, I guess.

H

G

All you can see this. This is the pca of thousands genome java data and I'm going to merge it with my sample the person who's. You know, ethnicity, that I want to consider and then, for example, if I, uh whenever I go, uh would you like that uh data? I can estimate the ancestry of the person so.

H

G

So uh this is an example of genomic data. It's in in we see a format that is a variant called.

G

H

G

A good picture, okay, so what happened is first of all the data that is I'm having is merged with the thousand genome data and then, after that, I used in a tool named p-link. It give me the eigenvalues and after that, these eigenvalues are plotted in the language are to get the uh you know, visualization.

G

So, for example, if I'm uh having here I'm having asian people- and here I'm having african people- and my point is someone here, so I get the chance that the person whose ancestry I'm going to check it belongs somewhere close to the african people and how it can be helpful. It is, uh can be used in selective medication because it has been found that different uh races are respond to different uh medicines in a different way.

G

There are some uh sorts of antibiotics that are said to be not as efficient in some races that are there in other, and uh one more thing is that early disease forecast, as you know, that uh indian people are, you know more prone to diabetes and they have a different uh yeah and like recursion people are, you know more prone to poland, energy?

G

I guess so what we can do, that, if you are able to, you, know, predict the ancestry of a person we can check out that uh which type of tissue is he is more prone into.

G

So these are just uh the things that I require. The variant called format of thousand genome data, the very calling format of my own data. Then I.

G

Biotechnology information, so here is the data flow. First of all, data is collected and it's converted uh in a format that is, you know, accessible by the tool feeling, and then we exclude the variants that are not necessary and after prune cloning, each of the chromosomes uh we get the eigenvalues and we link only give us the eigenvalue. It doesn't visualize that uh chart, so that is to be done in the r language, so the things required for that. We need to have bad scripting for creating the scripts.

G

We need p link, we need uh vcf tools, it is a linux tool that is used to convert the vcf file into the vcf files and we might need a little uh of perl scripting so that we can uh call the libraries which will help us in the pruning of data, and we have seen libraries sound tools, bcf2 then hts.

G

So I guess that's all and that's in you know, example of how the vcf data looks. It can be opened into uh microsoft excel so that to you know, visualize it and see how it is really it is.

G

I guess, that's all uh any questions. This is an example of uh pca, but it's uh it is pc or thousands of data.

G

Green denotes, east, asians and so, but.

H

Yeah, that's it well.

A

That's good! Thank you.

A

The stuff you were talking about on slack.

I

G

Was back, uh you know badly struck on it.

I

For a couple of weeks, yeah and you're able to make you're able to figure it out yeah. Actually there was some error in conversion.

G

Of files, I was.

I

H

G

My files- and it was you know it- really took me a couple of weeks to do it.

A

That's yeah, that's always a big sticking point in bioinformatics. It's like file conversion and all that so.

G

Yeah, we have a lot of formats. We have fast fast qpcs, one two works with one form that one doesn't and the debugging process was. You know very active yeah. So any questions regarding my project.

D

A

Did you want to push it to the repository yeah.

D

Okay, yeah later yeah.

A

D

A

Something I wanted to do like is that? Okay, oh yeah, that's good! That's good! Yeah yeah! Look forward to it! Okay! Well, thank you for attending um and uh if you have any questions, uh contact us on slack or on email or whatever and uh yeah see you next week thanks uh one.

G

I

G

I

uh It was regarding that.

G

I

It's the right time you.

G

Can explain otherwise you can do it. Some other.

A

Time, oh, we can probably do it offline like I can. I can talk about it on slack, so yeah all right, yeah.

I

C

Nice bye see you next week.