ISTEX data sample gives a parsing error

(Aymericvie) #1

Error at parsing. Please create a new corpus. If this error persists then report the bug with this id: 8082949.

I obtain the above when sampling a text query in ISTEX. Is there a work around?
Query was “genetic algorithm” or “genetic+algorithm”.
Creating a new corpus does not solve the issue.
I tried to process samples with .zip files, it’s running since yesterday but not completed yet.

Thanks a lot in advance.

(Aymericvie) #2

Other error message:

Error in corpus parsing at step 0 (Docs)[“Traceback (most recent call last):”,“File “/srv/env_3-5/lib/python3.5/site-packages/celery/app/”, line 240, in trace_task\n R = retval = fun(*args, **kwargs)”,“File “/srv/env_3-5/lib/python3.5/site-packages/celery/app/”, line 438, in protected_call\n return*args, **kwargs)”,“File “/srv/gargantext/gargantext/util/toolchain/”, line 69, in parse_extract_indexhyperdata\n parse(corpus)”,“File “/srv/gargantext/gargantext/util/toolchain/”, line 245, in parse\n raise error”,“File “/srv/gargantext/gargantext/util/toolchain/”, line 128, in parse\n for hyperdata in parserbot(resource[“path”]):”,“File “/srv/gargantext/gargantext/util/parsers/”, line 167, in iter\n yield from self.iter(f)”,“File “/srv/gargantext/gargantext/util/parsers/”, line 174, in iter\n for hyperdata in self.parse(file):”,“File “/srv/gargantext/gargantext/util/parsers/”, line 12, in parse\n json_docs = data[“hits”]”,“KeyError: ‘hits’”]NaN(bug report here)

(David Chavalarias) #3

Hi Aymeric,
This should be due to a back-end disconnection of Gargantext V3. This is a known issue that will be fixed in the forthcoming V4. We regularly restart the V3 for this purpose. You can retry now, it should be ok. Best.

(Aymericvie) #4

Thanks a lot!

Le mer. 28 oct. 2020 à 09:56, David Chavalarias a écrit :