Error: Data received on socket was not in expected format

whollycow007 · October 23, 2015, 8:20pm

<RC=0x16[Data received on socket was not in the expected format], HTTP Request failed. Examine 'objextra' for full result, Results=1, C Source=(src/http.c,140), OBJ=ViewResult<RC=0x16[Data received on socket was not in the expected format], Value='},\n\n    ],\n    "status": "timeout",\n    "metrics": {\n        "elapsedTime": "1m15.289138508s",\n        "executionTime": "1m15.28911565s",\n        "resultCount": 59777,\n        "resultSize": 78879564\n    }\n}\n', HTTP=200>>

Could someone point me towards how to fix this?

The error gets throw while iterating across the results of a N1QL query using the python SDK. Offending line:

for row in cb.n1ql_query(query):

Here’s the traceback:

Traceback (most recent call last):
  File "couchbaseDataExportES.py", line 95, in <module>
    writeToES(q)
  File "couchbaseDataExportES.py", line 29, in writeToES
    for row in cb.n1ql_query(query):
  File "/usr/local/lib/python2.7/dist-packages/couchbase/n1ql.py", line 343, in __iter__
    raw_rows = self.raw.fetch(self._mres)
_ProtocolError_0x16 (generated, catch ProtocolError): <RC=0x16[Data received on socket was not in the expected format], HTTP Request failed. Examine "objextra" for full result, Results=1, C Source=(src/http.c,140), OBJ=ViewResult<RC=0x16[Data received on socket was not in the expected format], Value="},\n\n    ],\n    "status": "timeout",\n    "metrics": {\n        "elapsedTime": "1m15.540084496s",\n        "executionTime": "1m15.540053887s",\n        "resultCount": 60085,\n        "resultSize": 79286006\n    }\n}\n", HTTP=200>>

The error seems to occur randomly.

whollycow007 · October 23, 2015, 8:29pm

There is noting wrong with the document that its trying to bring in. I am able to write a query to bring in the specific document. Moreover, it fails with this error randomly across docs (never consistently on the same doc).

manik · October 26, 2015, 8:39am

Can you please add some code to catch the exception and print the raw output and copy paste it here ?

Cheers,
Manik

whollycow007 · October 27, 2015, 6:16pm

@manik, I would love to but couchbase doesn’t yet have the proper documentation around catching error codes for n1ql queries using the python api.

can you please send me a link to the docs when they are complete and then i can trap the error for you.

right now, we just trap a generic error and reprocess. it works.

mnunberg · October 27, 2015, 9:31pm

The server is returning invalid JSON, as seen in the exception. Specifically:

},] is invalid JSON, as there is a trailing comma after the last item in the list.

mnunberg · October 27, 2015, 9:35pm

Issue filed as https://issues.couchbase.com/browse/MB-16659

manik · October 28, 2015, 4:59am

Aah, I see how that would have happened. The query timed out midway through the request processing. It should have inserted an empty result-set after that comma before the closing ‘]’ and the time-out error. @mnunberg looks like the default timeout for the python client is 75 seconds, as a workaround is it possible to increase that ? That way if a timeout may not occur.

cheers,
Manik

mnunberg · October 28, 2015, 5:34pm

The timeout can be set at a global basis by using the n1ql_timeout option in the connection string.

The timeout can be set on a per-query basis as well, but it must be lower or equal to the timeout specified globally (by default 75 seconds unless modified in the connection string).

Libcouchbase will also honor the timeout property in the N1QLQuery object itself, like so:

q.set_option("timeout", "100s")

for example.

I’ll see about adding a proper Pythonic way to modify the n1ql timeout in-situ (i.e. assignable after Bucket creation).

whollycow007 · October 28, 2015, 11:49pm

Thank you.

I set the n1ql_timeout to a higher value and that seems to have resolved this issue.

Though we now sometimes see this:

File \"/usr/local/lib/python2.7/dist-packages/couchbase/n1ql.py\", line 298, in _handle_meta\n    raise N1QLError.pyexc(\"N1QL Execution failed\", err)\nN1QLError: <N1QL Execution failed, OBJ={u\"msg\": u\"Index scan timed out - cause: Index scan timed out\", u\"code\": 12015}>\n"

Any suggestions?

whollycow007 · October 29, 2015, 12:48am

@mnunberg @manik

my new issue seems to be a duplicate of: Loading... and Loading... - maybe even related to Loading...

they all have closed/resolved status. do you know what the resolution is?

manik · October 29, 2015, 6:57pm

Hi @whollycow007,

With secondary indexes an index scan cannot exceed 2minutes. This is a design limitation. If you expect to be scanning a large dataset then you should consider one of the following workarounds

Create an index which specific to the query in question such that the selectivity of that that index is smaller than the original index. For e.g if the query contains a where clause then create the index with that where clause.
Use a view indexe.

Cheers,
Manik

whollycow007 · October 30, 2015, 2:07am

@manik

This is a test server with about 350,000 docs - mostly small docs. Total volume 255mb. 99% of the docs have a ‘type’ key and the ones that dont are atomic counters.

This is my index definition:

CREATE INDEX docType ON default(type,insertEpoch,updateEpoch) USING GSI

And this my query:

select distinct type from default where type is not null

A describe command shows the index docType is being used.

I’ll try a view index later tonight - could you briefly tell me the diff between a GSI and View? My assumption would be that views are auto refreshed every 5 secs whereas a GSI is updated based on data changes (traditional index)? I’m possibly completely wrong since I see spikes in IOPS after putting in the GSI too.

Help!

manik · November 2, 2015, 6:37am

Hi @whollycow007

Here is a document that details the difference between views and GSi

http://developer.couchbase.com/documentation/server/4.0/architecture/gsi-versus-views.html

Cheers,
Manik

geraldss · November 2, 2015, 3:13pm

One small addition to this thread. We are also providing covering indexes beginning with 4.1.0 developer preview. These indexes avoid the key-value fetch after scanning the index, and should produce better performance for your use case.

Topic		Replies	Views
n1qlTimeout exceeded without errors and partial data is received Node.js SDK query , n1ql , node , timeout	11	4045	July 27, 2017
N1ql query timeout Python SDK n1ql	1	1845	November 9, 2017
Querying views from python works.... but N1QL doesn't SQL++ n1ql	2	1927	September 12, 2016
Long running N1QL queries completing prematurely without exceptions Couchbase Server query , connections , n1ql	12	4614	April 26, 2016
Query Error on simple N1QL query SQL++	1	841	October 22, 2019

Error: Data received on socket was not in expected format

Related topics