Update Elasticsearch integration docs

a23edfed · Robert Speicher · 3dff8b82 · a23edfed
Commit a23edfed authored Jan 29, 2016 by Robert Speicher
Hide whitespace changes
Inline Side-by-side

Showing with 28 additions and 18 deletions

doc/integration/elasticsearch.md doc/integration/elasticsearch.md +28 -18

No files found.
--- a/doc/integration/elasticsearch.md
+++ b/doc/integration/elasticsearch.md
@@ -114,15 +114,19 @@ sudo gitlab-rake gitlab:elastic:index_repositories
 bundle exec rake gitlab:elastic:index_repositories RAILS_ENV=production
 ```

-If you want to run several tasks in parallel (probably in separate terminal windows) you can pass parameters `ID_FROM` and `ID_TO` like this:
+If you want to run several tasks in parallel (probably in separate terminal
+windows) you can provide the `ID_FROM` and `ID_TO` parameters:

 ```
 ID_FROM=1001 ID_TO=2000 sudo gitlab-rake gitlab:elastic:index_repositories

 ```

-Both parameters are optional. Keep in mind also that this task will skip repositories (and certain commits) you have already indexed. It stores the last commit SHA of every indexed repository in the database.
-As an example, if you have 3000 of repositories and you want to run tree separate indexer your commands would be like:
+Both parameters are optional. Keep in mind that this task will skip repositories
+(and certain commits) that have already been indexed. It stores the last commit
+SHA of every indexed repository in the database. As an example, if you have
+3,000 repositories and you want to run three separate indexing tasks, you might
+run:

 ```
 ID_TO=1000 sudo gitlab-rake gitlab:elastic:index_repositories
@@ -139,7 +143,9 @@ sudo gitlab-rake gitlab:elastic:index_wikis
 # Installations from source
 bundle exec rake gitlab:elastic:index_wikis RAILS_ENV=production
 ```
-Wiki indexer also supports `ID_FROM` and `ID_TO` parameters if you want to limit a project set.
+
+The wiki indexer also supports the `ID_FROM` and `ID_TO` parameters if you want
+to limit a project set.

 To index all database entities:

@@ -213,27 +219,31 @@ time drop.
    curl -XPOST 'http://localhost:9200/_forcemerge?max_num_segments=5'
    ```

-To minimize a downtime of search feature we recommend next sequence of actions:
-
-1. Configure elastic search in gitlab.yml or gitlab.rb for omnibus installations but
-   do not enable it, just set a host and port.
+To minimize downtime of the search feature we recommend the following:

-1. Create empty indexes. Run
-   ```
-   # Omnibus installations
-   sudo gitlab-rake gitlab:elastic:create_empty_indexes
+1. Configure Elasticsearch in `gitlab.yml`, or `gitlab.rb` for Omnibus
+   installations, but do not enable it, just set a host and port.

-   # Installations from source
-   bundle exec rake gitlab:elastic:create_empty_indexes
-   ```
+1. Create empty indexes:

-1. Index all repositories by `gitlab:elastic:index_repositories` rake task(see above). Probably you will do it in parallel
+    ```
+    # Omnibus installations
+    sudo gitlab-rake gitlab:elastic:create_empty_indexes

-1. Enable elasticsearch and restart GitLab application.
+    # Installations from source
+    bundle exec rake gitlab:elastic:create_empty_indexes
+    ```

-1. Run indexers for database, wikis and repositories. By running reposotory indexer twise you will be sure that eveything is indexed because some commits could be pushed while you performed initial indexing. And don't worry, repository indexer will skip repositories and commits that are already indexed, in other words, it will be much faster this time.
+1. Index all repositories using the `gitlab:elastic:index_repositories` Rake
+   task (see above). You'll probably want to do this in parallel.

+1. Enable Elasticsearch and restart GitLab.

+1. Run indexers for database, wikis, and repositories. By running the repository
+   indexer twice you will be sure that everything is indexed because some
+   commits could be pushed while you performed initial indexing. The repository
+   indexer will skip repositories and commits that are already indexed, so it
+   will be much shorter than the first run.

 [ee-109]: https://gitlab.com/gitlab-org/gitlab-ee/merge_requests/109 "Elasticsearch Merge Request"
 [elasticsearch]: https://www.elastic.co/products/elasticsearch "Elasticsearch website"