Killian Janod
2018-11-09 15:04:30 UTC
Dear arango community,
I need a little help =) I'll try to make this as simple as possible. I
have a not that big database ( 25G on drive ) with two main collections
(Let's call them A and B ). A contain 31,025,557 doc(s) and B 20,785,230.
There is many indexes on both ( with high selectivity ). Data in A and B
are semantically linked. They described the same events.
Arango is in version 3.9.19 with rocksDB engine and default configuration (
installed with docker).
I'm trying to find every document b in B that have a least one counter part
a in A. Then return b and it's most recent a counter part.
My problem is that i made two AQL queries that does the job but use a
huge amount of RAM >60G.
Both request can be found here :
https://gist.github.com/Waateur/6c8c0b2c40d0dfa08cecfe780275cd9f
The first one use join and the second one use Graph.
I pretty sure I'm missing something logical. There must be something wrong
with my queries.
Can someone help me reduce these memory consumption ? ( I know time/memory
is a trade off, spending more time is okay in my workflow )
Any advice/criticisms are welcome.
Sincerely,
Killian Janod
I need a little help =) I'll try to make this as simple as possible. I
have a not that big database ( 25G on drive ) with two main collections
(Let's call them A and B ). A contain 31,025,557 doc(s) and B 20,785,230.
There is many indexes on both ( with high selectivity ). Data in A and B
are semantically linked. They described the same events.
Arango is in version 3.9.19 with rocksDB engine and default configuration (
installed with docker).
I'm trying to find every document b in B that have a least one counter part
a in A. Then return b and it's most recent a counter part.
My problem is that i made two AQL queries that does the job but use a
huge amount of RAM >60G.
Both request can be found here :
https://gist.github.com/Waateur/6c8c0b2c40d0dfa08cecfe780275cd9f
The first one use join and the second one use Graph.
I pretty sure I'm missing something logical. There must be something wrong
with my queries.
Can someone help me reduce these memory consumption ? ( I know time/memory
is a trade off, spending more time is okay in my workflow )
Any advice/criticisms are welcome.
Sincerely,
Killian Janod
--
You received this message because you are subscribed to the Google Groups "ArangoDB" group.
To unsubscribe from this group and stop receiving emails from it, send an email to arangodb+***@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
You received this message because you are subscribed to the Google Groups "ArangoDB" group.
To unsubscribe from this group and stop receiving emails from it, send an email to arangodb+***@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.