Winter cleaning the Lava database/job history

List overview All Threads
Download

newer

older

Empty files in lava-logs...

Adding Android Device to LAVA...

Sjoerd Simons

4 Jan 2016 4 Jan '16

2:37 p.m.

Hey all,

Our lava instance is currently well over 120.000 jobs ran, which is great.

Unfortunately this means we've got quite a lot of historical data, the postgresql database is around 45 gigs and well over 600 gigs of job output data in the filesystem. We'd love to trim that down somewhat to more sensible sizes by pruning the older job information. Are there some guidelines on how to do that (and or tools available for)?

-- Sjoerd Simons Collabora Ltd.

Show replies by date

Neil Williams

4 Jan 4 Jan

8:17 p.m.

On Mon, 04 Jan 2016 15:37:38 +0100 Sjoerd Simons sjoerd.simons@collabora.co.uk wrote:

...

Hey all,

Our lava instance is currently well over 120.000 jobs ran, which is great.

Unfortunately this means we've got quite a lot of historical data, the postgresql database is around 45 gigs and well over 600 gigs of job output data in the filesystem. We'd love to trim that down somewhat to more sensible sizes by pruning the older job information. Are there some guidelines on how to do that (and or tools available for)?

production is larger still, staging is about the same size as your instance. We've considered this problem a few times but we don't have clear answers right now. It's pending - we need the input from a DBA on just how to optimise the current database models, implement an archive / purge method and keep on top of the changes due in the refactoring.

Deleting bundles (and attachments) has implications for also removing files on the filesystem. TestJobs are the most obvious metric for size but you may get a more relevant metric from a count of TestResult. It is the bundles and dashboard testresult table which use up the most space.

$ sudo lava-server manage shell Python 2.7.11 (default, Dec 9 2015, 00:29:25) [GCC 5.3.1 20151205] on linux2 Type "help", "copyright", "credits" or "license" for more information. (InteractiveConsole)

...

...
...
from dashboard_app.models import TestResult TestResult.objects.count()

Or from postgres (where timing is available):

lava-staging=# select count(id) from dashboard_app_testresult; count --------- 2625659 (1 row)

Time: 367.321 ms

With the pipeline, testresult will be moving into the lava_results_app table. Long term, this means that refactoring will have an impact here - once the code support for Bundle et al is removed, there will be database migrations to drop those tables. Just dealing with those migrations is going to be an issue for large instances.

-- Neil Williams ============= http://www.linux.codehelp.co.uk/

Neil Williams

5 Jan 5 Jan

4:11 p.m.

On Mon, 4 Jan 2016 20:17:55 +0000 Neil Williams codehelp@debian.org wrote:

...

On Mon, 04 Jan 2016 15:37:38 +0100 Sjoerd Simons sjoerd.simons@collabora.co.uk wrote:

...
Hey all,

Our lava instance is currently well over 120.000 jobs ran, which is great.

Unfortunately this means we've got quite a lot of historical data, the postgresql database is around 45 gigs and well over 600 gigs of job output data in the filesystem. We'd love to trim that down somewhat to more sensible sizes by pruning the older job information. Are there some guidelines on how to do that (and or tools available for)?

production is larger still, staging is about the same size as your instance. We've considered this problem a few times but we don't have clear answers right now. It's pending - we need the input from a DBA on just how to optimise the current database models, implement an archive / purge method and keep on top of the changes due in the refactoring.

Deleting bundles (and attachments) has implications for also removing files on the filesystem. TestJobs are the most obvious metric for size but you may get a more relevant metric from a count of TestResult. It is the bundles and dashboard testresult table which use up the most space.

$ sudo lava-server manage shell Python 2.7.11 (default, Dec 9 2015, 00:29:25) [GCC 5.3.1 20151205] on linux2 Type "help", "copyright", "credits" or "license" for more information. (InteractiveConsole)

...
...
...
from dashboard_app.models import TestResult TestResult.objects.count()

Or from postgres (where timing is available):

lava-staging=# select count(id) from dashboard_app_testresult; count

2625659 (1 row)

Time: 367.321 ms

Sjoerd: do have any performance issues with a database that size? A comparison of the commands above would be useful (along with information on the load on that machine / amount of RAM etc.).

-- Neil Williams ============= http://www.linux.codehelp.co.uk/

Sjoerd Simons

6 Jan 6 Jan

9:19 a.m.

On Tue, 2016-01-05 at 16:11 +0000, Neil Williams wrote:

...

On Mon, 4 Jan 2016 20:17:55 +0000 Neil Williams codehelp@debian.org wrote:

...
On Mon, 04 Jan 2016 15:37:38 +0100 Sjoerd Simons sjoerd.simons@collabora.co.uk wrote:

...
Hey all,

Our lava instance is currently well over 120.000 jobs ran, which is great.

Unfortunately this means we've got quite a lot of historical data, the postgresql database is around 45 gigs and well over 600 gigs of job output data in the filesystem. We'd love to trim that down somewhat to more sensible sizes by pruning the older job information. Are there some guidelines on how to do that (and or tools available for)?

production is larger still, staging is about the same size as your instance. We've considered this problem a few times but we don't have clear answers right now. It's pending - we need the input from a DBA on just how to optimise the current database models, implement an archive / purge method and keep on top of the changes due in the refactoring.

Deleting bundles (and attachments) has implications for also removing files on the filesystem. TestJobs are the most obvious metric for size but you may get a more relevant metric from a count of TestResult. It is the bundles and dashboard testresult table which use up the most space.

$ sudo lava-server manage shell Python 2.7.11 (default, Dec 9 2015, 00:29:25) [GCC 5.3.1 20151205] on linux2 Type "help", "copyright", "credits" or "license" for more information. (InteractiveConsole)

...
...
...
from dashboard_app.models import TestResult TestResult.objects.count()

Or from postgres (where timing is available):

lava-staging=# select count(id) from dashboard_app_testresult; count

2625659 (1 row)

Time: 367.321 ms

lavaserver=> select count(id) from dashboard_app_testresult; count ----------- 193416946 (1 row)

Time: 38213.638 ms

...

Sjoerd: do have any performance issues with a database that size? A comparison of the commands above would be useful (along with information on the load on that machine / amount of RAM etc.).

It's a virtual machine, 3 cpu, 4G of ram. The host machine is a lightly loaded 12 core E5-2630L machine (24 threads/logical cores) with a hardware raid array for storage so I/O should be pretty decent.

The query you suggested doesn't perform too well, but overall we don't currently see runtime performance issues with lava at the moment. It just eats lots and lots of disk space :)

-- Sjoerd Simons Collabora Ltd.

3518

days inactive

3520

days old

linaro-validation@lists.linaro.org

3 comments

participants

tags (0)

participants (2)

Neil Williams
Sjoerd Simons