Skip to content
Projects
Groups
Snippets
Help
Loading...
Help
Support
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
G
gitlab-ce
Project overview
Project overview
Details
Activity
Releases
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Issues
0
Issues
0
List
Boards
Labels
Milestones
Merge Requests
1
Merge Requests
1
Analytics
Analytics
Repository
Value Stream
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Create a new issue
Commits
Issue Boards
Open sidebar
nexedi
gitlab-ce
Commits
6f763748
Commit
6f763748
authored
Dec 20, 2021
by
Grzegorz Bizon
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
Add a section about principles to CI/CD time decay blueprint
Also add a few more minor changes
parent
4fd5ab16
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
58 additions
and
39 deletions
+58
-39
doc/architecture/blueprints/ci_data_decay/index.md
doc/architecture/blueprints/ci_data_decay/index.md
+58
-39
No files found.
doc/architecture/blueprints/ci_data_decay/index.md
View file @
6f763748
...
@@ -50,19 +50,23 @@ cost effective.
...
@@ -50,19 +50,23 @@ cost effective.
It is already possible to prevent processing builds
[
that have been
It is already possible to prevent processing builds
[
that have been
archived
](
../../../user/admin_area/settings/continuous_integration.md#archive-jobs
)
.
archived
](
../../../user/admin_area/settings/continuous_integration.md#archive-jobs
)
.
When a build gets archived it will not be possible to retry it, but we
do not
When a build gets archived it will not be possible to retry it, but we
still do
move data out of the database, it still consumes resources that are scarce in
keep all the processing metadata in the database, and it consumes resources
the primary database.
th
at are scarce in th
e primary database.
In order to improve performance and make it easier to scale CI/CD data storage
In order to improve performance and make it easier to scale CI/CD data storage
we might want to follow these three tracks described below.
we might want to follow these three tracks described below.
![
pipeline data time decay
](
pipeline_data_time_decay.png
)
![
pipeline data time decay
](
pipeline_data_time_decay.png
)
<!-- markdownlint-disable MD029 -->
1.
Partition builds queuing tables
1.
Partition builds queuing tables
2.
Archive CI/CD data into partitioned database schema
2.
Archive CI/CD data into partitioned database schema
3.
Migrate archived builds metadata out of primary database
3.
Migrate archived builds metadata out of primary database
<!-- markdownlint-enable MD029 -->
### Migrate archived builds metadata out of primary database
### Migrate archived builds metadata out of primary database
Once a build (or a pipeline) gets archived, it is no longer possible to resume
Once a build (or a pipeline) gets archived, it is no longer possible to resume
...
@@ -81,12 +85,12 @@ be able to use de-duplication of metadata entries and other normalization
...
@@ -81,12 +85,12 @@ be able to use de-duplication of metadata entries and other normalization
strategies to consume less storage while retaining ability to query this
strategies to consume less storage while retaining ability to query this
dataset. Technical evaluation will be required to find the best solution here.
dataset. Technical evaluation will be required to find the best solution here.
Epic:
[
Migrate
build metadata of archived pipelines
](
https://gitlab.com/groups/gitlab-org/-/epics/7216
)
.
Epic:
[
Migrate
archived builds metadata out of primary database
](
https://gitlab.com/groups/gitlab-org/-/epics/7216
)
.
### Archive CI/CD data into partitioned database schema
### Archive CI/CD data into partitioned database schema
After we move CI/CD metadata to a different store, the problem of having
After we move CI/CD metadata to a different store, the problem of having
billions of rows describing pipelines, build and artifacts, remains. We still
billions of rows describing pipelines, build
s
and artifacts, remains. We still
need to keep reference to the metadata we store in object storage and we still
need to keep reference to the metadata we store in object storage and we still
do need to be able to retrieve this information reliably in bulk (or search
do need to be able to retrieve this information reliably in bulk (or search
through it).
through it).
...
@@ -99,9 +103,8 @@ reduce the impact on the database (indices size, auto-vacuum time and
...
@@ -99,9 +103,8 @@ reduce the impact on the database (indices size, auto-vacuum time and
frequency).
frequency).
Our intent here is not to move this data out of our primary database elsewhere.
Our intent here is not to move this data out of our primary database elsewhere.
What we want to achieve here is to divide very large database tables, that
What want to divide very large database tables, that store CI/CD data, into
store CI/CD data, into multiple smaller ones, using PostgreSQL partitioning
multiple smaller ones, using PostgreSQL partitioning features.
capabilities.
There are a few approaches we can take to partition CI/CD data. A promising one
There are a few approaches we can take to partition CI/CD data. A promising one
is using list-based partitioning where a partition number is assigned a
is using list-based partitioning where a partition number is assigned a
...
@@ -116,7 +119,7 @@ partitioning on the application level.
...
@@ -116,7 +119,7 @@ partitioning on the application level.
Partitioning rarely accessed data should also follow the policy defined for
Partitioning rarely accessed data should also follow the policy defined for
builds archival, to make it consistent and reliable.
builds archival, to make it consistent and reliable.
Epic:
[
Partition archived CI/CD dat
a
](
https://gitlab.com/groups/gitlab-org/-/epics/5417
)
.
Epic:
[
Archive CI/CD data into partitioned database schem
a
](
https://gitlab.com/groups/gitlab-org/-/epics/5417
)
.
### Partition builds queuing tables
### Partition builds queuing tables
...
@@ -124,10 +127,11 @@ While working on the [CI/CD Scale](../ci_scale/index.md) blueprint, we have
...
@@ -124,10 +127,11 @@ While working on the [CI/CD Scale](../ci_scale/index.md) blueprint, we have
introduced a
[
new architecture for queuing CI/CD builds
](
https://gitlab.com/groups/gitlab-org/-/epics/5909#note_680407908
)
introduced a
[
new architecture for queuing CI/CD builds
](
https://gitlab.com/groups/gitlab-org/-/epics/5909#note_680407908
)
for execution.
for execution.
This allowed us to significant improve performance, but we still do consider
This allowed us to significantly improve performance. We still consider the new
the new solution as an intermediate mechanism, needed before we start working
solution as an intermediate mechanism, needed before we start working on the
on the next iteration, that should improve the architecture of builds queuing
next iteration. The following iteration that should improve the architecture of
even more (it might require moving off the PostgreSQL fully or partially).
builds queuing even more (it might require moving off the PostgreSQL fully or
partially).
In the meantime we want to ship another iteration, an intermediate step towards
In the meantime we want to ship another iteration, an intermediate step towards
more flexible and reliable solution. We want to partition the new queuing
more flexible and reliable solution. We want to partition the new queuing
...
@@ -139,49 +143,64 @@ for builds archival. Instead we should leverage a long-standing policy saying
...
@@ -139,49 +143,64 @@ for builds archival. Instead we should leverage a long-standing policy saying
that builds created more 24 hours ago need to be removed from the queue. This
that builds created more 24 hours ago need to be removed from the queue. This
business rule is present in the product since the inception of GitLab CI.
business rule is present in the product since the inception of GitLab CI.
Epic:
[
P
repare queuing tables for list-style partitioning
](
https://gitlab.com/gitlab-org/gitlab/-/issues/347027
)
.
Epic:
[
P
artition builds queuing tables
](
https://gitlab.com/gitlab-org/gitlab/-/issues/347027
)
.
##
Caveat
s
##
Principle
s
All the three tracks we will use to implement CI/CD time decay pattern are
All the three tracks we will use to implement CI/CD time decay pattern are
associated with some challenges. Most important ones are documented below.
associated with some challenges. As we progress with the implementation we will
need to solve many problems and devise many implementation details to make this
successful.
Below, we documented a few foundational principles to make it easier for
everyone to understand the vision described in this architectural blueprint.
### Removing data
### Removing
pipeline
data
While it might be tempting to simply remove old or archived data from our
While it might be tempting to simply remove old or archived data from our
database
,
this should be avoided. We should not permanently remove user data
database
s
this should be avoided. We should not permanently remove user data
unless
a
consent is given to do so. We can, however, move data to a different
unless consent is given to do so. We can, however, move data to a different
data store, like object storage.
data store, like object storage.
Archived data can still be needed sometimes (for example for compliance
Archived data can still be needed sometimes (for example for compliance
or
reasons). We want to be able to retrieve this data if needed, as long as
auditing reasons). We want to be able to retrieve this data if needed, as long
permanent removal has not been requested or approved by a user.
as
permanent removal has not been requested or approved by a user.
### Accessing
data
### Accessing
pipeline data in the UI
Implementing CI/CD data time-decay through partitioning might be challenging
Implementing CI/CD data time-decay through partitioning might be challenging
when we still want to make it possible for users to access data stored across
when we still want to make it possible for users to access data stored in many
many partitions.
partitions.
We want to retain simplicity of accessing pipeline data in the UI. It will
require some backstage changes in how we reference pipeline data from other
resources, but we don't want to make it more difficult for users to find their
pipelines in the UI.
We may need to add "Archived" tab on the pipelines / builds list pages, but we
should be able to avoid additional steps / clicks when someone wants to view
pipeline status or builds associated with a merge request or a deployment.
We also may need to disable search in the "Archived" tab on pipelines / builds
list pages.
In order to do that we will need to make sure that when archived data needs be
### Accessing pipeline data through the API
accessed, users provide a time range in which the data has been created. In
order to make it efficient it might be necessary to restrict access to querying
data residing in more than two partitions at once. We can do that by supporting
time ranges spanning the duration that equals to the builds archival policy.
#### Merge request pipelines
We accept the possible necessity of building a separate API endpoint /
endpoints needed to access pipeline data through the API.
Once we partition CI/CD data, especially CI builds, we need to find an
In the new API users might need to provide a time range in which the data has
efficient mechanism to present pipeline statuses in merge requests.
been created to search through their pipelines / builds. In order to make it
efficient it might be necessary to restrict access to querying data residing in
more than two partitions at once. We can do that by supporting time ranges
spanning the duration that equals to the builds archival policy.
How to exactly do that is an implementation detail that we will need to figure
It is possible to still allow users to use the old API to access archived
out as the work progresses. We do have many tools to achieve that - data
pipelines data, although a user provided partition identifier may be required.
denormalization, routing reads to proper partitions based on data stored with a
merge request.
## Iterations
## Iterations
All three tacks can be worked on in parallel:
All three t
r
acks can be worked on in parallel:
1.
[
Migrate archived build metadata to object storage
](
https://gitlab.com/groups/gitlab-org/-/epics/7216
)
.
1.
[
Migrate archived build metadata to object storage
](
https://gitlab.com/groups/gitlab-org/-/epics/7216
)
.
1.
[
Partition CI/CD data that have been archived
](
https://gitlab.com/groups/gitlab-org/-/epics/5417
)
.
1.
[
Partition CI/CD data that have been archived
](
https://gitlab.com/groups/gitlab-org/-/epics/5417
)
.
...
@@ -189,7 +208,7 @@ All three tacks can be worked on in parallel:
...
@@ -189,7 +208,7 @@ All three tacks can be worked on in parallel:
## Status
## Status
Request For Comments
.
In approval
.
## Who
## Who
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment