Error bulk loading

Stardog 5.0-beta

I'm getting the following exception in the logs trying to load yago. I've seen several of them in the logs. It's only a warning and doesn't terminate the bulk loading but it looked like something you might be interested in seeing.

WARN  2017-05-28 16:21:43,940 [Stardog.Executor-32] com.complexible.common.rdf.rio.RDFStreamProcessor:setException(591): Parse process creator: 
java.lang.Exception: null
	at com.complexible.common.rdf.rio.RDFStreamProcessor.<init>(RDFStreamProcessor.java:129) ~[stardog-utils-rdf-5.0-beta.jar:?]
	at com.complexible.common.rdf.rio.RDFStreamProcessor.create(RDFStreamProcessor.java:105) ~[stardog-utils-rdf-5.0-beta.jar:?]
	at com.complexible.stardog.index.StardogIndexLoader.loader(StardogIndexLoader.java:76) ~[stardog-5.0-beta.jar:?]
	at com.complexible.stardog.StardogKernel.bulkLoad(StardogKernel.java:1077) ~[stardog-5.0-beta.jar:?]
	at com.complexible.stardog.StardogKernel.createDatabase(StardogKernel.java:1233) ~[stardog-5.0-beta.jar:?]
	at com.complexible.stardog.protocols.http.server.AdminDatabaseService.createNewDatabase(AdminDatabaseService.java:432) ~[stardog-protocols-http-server-5.0-beta.jar:?]
	at com.stardog.http.server.undertow.jaxrs.ExtractRoutes.lambda$handleIt$81(ExtractRoutes.java:183) ~[stardog-protocols-http-server-5.0-beta.jar:?]
	at com.stardog.http.server.undertow.jaxrs.ExtractRoutes$$Lambda$196/22363198.run(Unknown Source) ~[?:?]
	at org.apache.shiro.subject.support.SubjectRunnable.doRun(SubjectRunnable.java:120) ~[shiro-core-1.2.3.jar:1.2.3]
	at org.apache.shiro.subject.support.SubjectRunnable.run(SubjectRunnable.java:108) ~[shiro-core-1.2.3.jar:1.2.3]
	at com.stardog.http.server.undertow.ErrorHandling.lambda$safeDispatch$43(ErrorHandling.java:70) ~[stardog-protocols-http-server-5.0-beta.jar:?]
	at com.stardog.http.server.undertow.ErrorHandling$$Lambda$197/1250743265.run(Unknown Source) ~[?:?]
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) [?:1.8.0_05]
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) [?:1.8.0_05]
	at java.lang.Thread.run(Thread.java:745) [?:1.8.0_05]

....and another exception further on while computing statistics

INFO 2017-05-29 02:27:09,326 [XNIO-1 task-14] com.complexible.stardog.StardogKernel:stop(329): Computing statistics finished in 00:05:46.931
ERROR 2017-05-29 02:27:19,617 [XNIO-1 task-14] com.stardog.http.server.undertow.ErrorHandling:writeError(179): Unexpected error on the server
java.lang.StackOverflowError: null
at java.util.TreeMap.keySet(TreeMap.java:817) ~[?:1.8.0_05]
at com.complexible.common.trie.TrieStructurePersister.persist(TrieStructurePersister.java:44) ~[stardog-utils-common-5.0-beta.jar:?]
at com.complexible.common.trie.TrieStructurePersister.persist(TrieStructurePersister.java:46) ~[stardog-utils-common-5.0-beta.jar:?]

The last line is repeated a couple hundred times.

Hi Zach,

Could you point to the data file which we could use to reproduce this?

Thanks,
Pavel

http://resources.mpi-inf.mpg.de/yago-naga/yago/download/yago/yago3_entire_ttl.7z

Thanks. There’s some ongoing work on improving statistics computation on highly diverse datasets, e.g. yago or Wikidata, and I’ll take a look at this issue in that context before 5.0-final.

Cheers,
Pavel