I’ve split all my data into graphs, one main graph with 30 million triples, one for the ontology and one for our user data.
I’ve enabled ICV on the ontology graph and on the user data graph.
Insert queries take around 20 seconds for creating a user in the user data graph
When I drop the main graph (the one with 30 million triples) then inserting a user takes 500 ms
Is this performance abnormal? Can I somehow debug what the ICV engine is doing to find out why the performance drops when I add data to a non-ICV graph?
On top of the insert query example, can I also get the ICV configuration you’re using? This includes your stardog.properties file and the output of stardog-admin metadata get myDb.
The data in newGraph is real data from the client. It's 1/17th the amount of data we expect in production. And we also expect a substantial growth in the years to come.
I've tried adding just about anything to the newGraph and it still slows down the ICV validation. I downloaded and added this file http://rdf.geospecies.org/geospecies.rdf.gz and the insert query goes from 500ms to 600-800 ms.
I seem unable to reproduce your issue in Stardog 4.2.4, though granted the data I’m using probably isn’t hitting ICV the same way yours is. I inserted a user into the <http://data.einnsyn.no/brukermeta/> graph on the testSesame db as created by your pasted db create command. I then loaded the geospecies.rdf.gz data multiple times into separate graphs. I loaded it once, then 5 more times, then 15 more times, and after each load I was able to insert one more user into the <http://data.einnsyn.no/brukermeta/> graph in about the same amount of time.
Are you on Stardog 4.2.4? If you try setting reasoning.consistency.automatic or icv.reasoning.enabled to false, do you see any difference?
Turning off automatic consistency doesn’t seem to do anything with the performance on this little data, but it does make it faster when I have 10x the amount.
Turning off icv reasoning though doesn’t work, cause there is data that is dependent on reasoning being enabled.
I know there is some data that you don’t have, I’ll just simply make a dump of the database for you.