GitLab / Sharding Working Group

Add meeting Rate page Subscribe

GitLab / Sharding Working Group

These are all the meetings we have in "Sharding Working Group" (part of the organization "GitLab"). Click into individual meeting pages to watch the recording and search or read the transcript.

15 Jul 2021

Kickoff for the 14.2 release for the Sharding team.

Planning issue: https://gitlab.com/gitlab-org/sharding-group/group-tasks/-/issues/1
  • 1 participant
  • 7 minutes
sharding
gitlab
scalability
servers
important
process
cluster
schemas
dashboards
complexity
youtube image

22 Jun 2020

This video culminates in the conclusion of the Sharding Working group. The timelines below are approximate:

0:00 Intro, read agenda
4:05 Gerir Feedback on sharding approach
5:30 Clarification of Sharding Terminology
7:10 Discussed time and ability to explore other scalability options
8:02 Questioned Audit Events priority
8:36 Requested Lexicon
9:12 Doing Partitioning not doing Sharding - other low hanging fruits
10:15 Garbage in garbage out
13:40 Start talking about alternate solutions
16:10 Talking about Big instances of GitLab (EU, Federal, etc)
18:04 Propose closing Sharding Group and focus on Scalability
20:00 Decision to move forward with Scaling Working Group
21:43 Charter for Scaling Working Group
  • 7 participants
  • 24 minutes
sharding
partitioning
roadmap
migration
discussion
namespace
structuring
renaming
hosting
scaling
youtube image

8 Jun 2020

No description provided.
  • 7 participants
  • 23 minutes
sharding
migrate
partitioning
scalability
auditing
prioritizations
capacity
managed
incremental
workflows
youtube image

26 May 2020

No description provided.
  • 7 participants
  • 30 minutes
capacity
analyzing
planning
sharding
database
conclusions
details
production
performance
summarize
youtube image

18 May 2020

No description provided.
  • 8 participants
  • 24 minutes
capacity
benchmarking
planning
gpt
phosphorus
research
needs
charting
optimizing
dbt
youtube image

4 May 2020

No description provided.
  • 5 participants
  • 28 minutes
partition
migrations
sharding
charting
partitioning
merge
diverging
capabilities
partitions
plan
youtube image

29 Apr 2020

  • 1 participant
  • 11 minutes
replication
schema
partitioning
charts
database
issue
publish
replicating
logical
demos
youtube image

29 Apr 2020

Quick demo of using postgres declarative partitioning in combination with foreign data wrappers to push partitions to a foreign server.

We're tackling one table of the GitLab schema here, creating 8 partitions and distributing them on 2 shards. Shard in this example is a local database, which can also live on a remote cluster.

Part 1: https://www.youtube.com/watch?v=MiZFtM84x44
Part 2: https://www.youtube.com/watch?v=nt4Khi9Gr3o&feature=youtu.be
Part 3: https://youtu.be/ztQtNmSYmEo

https://gitlab.com/gitlab-org/database-team/team-tasks/-/issues/53
  • 1 participant
  • 8 minutes
partitioning
sharding
github
distributed
postgres
servers
project
migrations
partition
petitions
youtube image

29 Apr 2020

Follow-up to https://www.youtube.com/watch?v=MiZFtM84x44 to checkout how we can do schema migrations.

Very basic examples: Adding and dropping a column.


Part 1: https://www.youtube.com/watch?v=MiZFtM84x44
Part 2: https://www.youtube.com/watch?v=nt4Khi9Gr3o&feature=youtu.be
Part 3: https://youtu.be/ztQtNmSYmEo
  • 1 participant
  • 6 minutes
partition
schema
migrations
shards
setup
column
partitions
table
alter
charts
youtube image

27 Apr 2020

No description provided.
  • 7 participants
  • 23 minutes
maintenance
scheduled
iacv
services
migrations
andreas
managed
today
features
concern
youtube image

24 Apr 2020

No description provided.
  • 7 participants
  • 27 minutes
services
brainstorming
decisions
extraction
discussion
workloads
efficiently
process
proposal
concerns
youtube image

20 Apr 2020

No description provided.
  • 8 participants
  • 15 minutes
situs
testing
significant
charting
maintenance
performance
estimates
concerns
notes
pg
youtube image

16 Apr 2020

We discuss different scaling approaches for GitLab, namely
* database sharding
* application sharding
* service extraction.

Internal doc with notes: https://docs.google.com/document/d/1u9e736mu4SlVtE2gf4HWsqsoV__Dk68qtsrvyfXvky8/edit#

Rough timeline
0:00 Overview
3:30 Service Extraction - Concerns
5:30 Databases on the same cluster
5:58 Praefect Example
7:13 Security concerns when databases share the same schema
8:04 Container registry example
9:10 Data management concerns with single database
11:20 Replaceability of systems - modularity
16:08 What is application sharding Pros/Cons
21:13 Considerations of sharding by namespaces
28:06 What's the strategy
29:56 Management of self hosted databases
31:20 Replication
31:53 Praefect Replication concerns
35:00 Application sharding probably inevitable
36:44 Service extraction details
41:50 Distribution concerns with Service Extraction
43:30 Summary - Service Extraction next steps
48:00 Database guidance and struggles with current structure
  • 8 participants
  • 1:00 hours
databases
discussion
sharding
manages
registry
configuration
enterprise
charting
disagreement
separate
youtube image

8 Apr 2020

No description provided.
  • 6 participants
  • 20 minutes
costs
cpus
servers
utilization
workloads
1100
scalability
estimate
cores
production
youtube image

6 Apr 2020

No description provided.
  • 8 participants
  • 20 minutes
situs
postgres
provisioning
concerns
pg
distributed
dependency
managed
clarification
postscript
youtube image

1 Apr 2020

No description provided.
  • 5 participants
  • 26 minutes
enterprise
licensing
transactional
offering
enterprises
decisions
customers
concerns
policy
microservices
youtube image

30 Mar 2020

No description provided.
  • 5 participants
  • 10 minutes
situs
staging
implementation
ongoing
coordinating
updates
siteís
exploring
cluster
manage
youtube image

23 Mar 2020

No description provided.
  • 8 participants
  • 25 minutes
migrated
planning
shortly
process
considerations
leaving
maintenance
pg
latency
scaling
youtube image

2 Mar 2020

No description provided.
  • 7 participants
  • 28 minutes
staging
considerations
charting
plan
siteís
scaling
prioritization
timelines
functionality
situs
youtube image

24 Feb 2020

No description provided.
  • 12 participants
  • 27 minutes
sharding
discussion
pg
provides
planning
updated
ongoing
significantly
general
reviewed
youtube image