ISTEX data sample gives a parsing error


(Aymericvie) #1

Error at parsing. Please create a new corpus. If this error persists then report the bug with this id: 8082949.

Hi,
I obtain the above when sampling a text query in ISTEX. Is there a work around?
Query was “genetic algorithm” or “genetic+algorithm”.
Creating a new corpus does not solve the issue.
I tried to process samples with .zip files, it’s running since yesterday but not completed yet.

Thanks a lot in advance.
Best,
Aymeric


(Aymericvie) #2

Other error message:

Error in corpus parsing at step 0 (Docs)[“Traceback (most recent call last):”,“File “/srv/env_3-5/lib/python3.5/site-packages/celery/app/trace.py”, line 240, in trace_task\n R = retval = fun(*args, **kwargs)”,“File “/srv/env_3-5/lib/python3.5/site-packages/celery/app/trace.py”, line 438, in protected_call\n return self.run(*args, **kwargs)”,“File “/srv/gargantext/gargantext/util/toolchain/main.py”, line 69, in parse_extract_indexhyperdata\n parse(corpus)”,“File “/srv/gargantext/gargantext/util/toolchain/parsing.py”, line 245, in parse\n raise error”,“File “/srv/gargantext/gargantext/util/toolchain/parsing.py”, line 128, in parse\n for hyperdata in parserbot(resource[“path”]):”,“File “/srv/gargantext/gargantext/util/parsers/_Parser.py”, line 167, in iter\n yield from self.iter(f)”,“File “/srv/gargantext/gargantext/util/parsers/_Parser.py”, line 174, in iter\n for hyperdata in self.parse(file):”,“File “/srv/gargantext/gargantext/util/parsers/ISTEX.py”, line 12, in parse\n json_docs = data[“hits”]”,“KeyError: ‘hits’”]NaN(bug report here)


(David Chavalarias) #3

Hi Aymeric,
This should be due to a back-end disconnection of Gargantext V3. This is a known issue that will be fixed in the forthcoming V4. We regularly restart the V3 for this purpose. You can retry now, it should be ok. Best.


(Aymericvie) #4

Thanks a lot!

Le mer. 28 oct. 2020 à 09:56, David Chavalarias noreply@iscpif.fr a écrit :