Overflow error string longer than 2147483647 bytes

problem Sketch When using python requests to upload files, theOverflow error: String longer than 2147483647 bytes error detail Problem code data = {} with open("bigfile", "rb") as f: r = requests.post(PUBLISH_URL, data=data, files={"xxx": f}) traceback Traceback (most recent call last): File "tUTF-8...

problem

Sketch

When using python requests to upload files, the
Overflow error: String longer than 2147483647 bytes error

detail

Problem code

    data = {}
    with open("bigfile", "rb") as f:
        r = requests.post(PUBLISH_URL, data=data, files={"xxx": f})

traceback

Traceback (most recent call last):
  File "test.py", line 52, in <module>
    main()
  File "test.py", line 49, in main
    publish()
  File "test.py", line 41, in publish
    r = requests.post(PUBLISH_URL, data=cfg, files={file_key: ("./test.apk", f)})
  File "/usr/local/lib/python2.7/dist-packages/requests/api.py", line 116, in post
    return request('post', url, data=data, json=json, **kwargs)
  File "/usr/local/lib/python2.7/dist-packages/requests/api.py", line 60, in request
    return session.request(method=method, url=url, **kwargs)
  File "/usr/local/lib/python2.7/dist-packages/requests/sessions.py", line 533, in request
    resp = self.send(prep, **send_kwargs)
  File "/usr/local/lib/python2.7/dist-packages/requests/sessions.py", line 646, in send
    r = adapter.send(request, **kwargs)
  File "/usr/local/lib/python2.7/dist-packages/requests/adapters.py", line 449, in send
    timeout=timeout
  File "/usr/local/lib/python2.7/dist-packages/urllib3/connectionpool.py", line 600, in urlopen
    chunked=chunked)
  File "/usr/local/lib/python2.7/dist-packages/urllib3/connectionpool.py", line 354, in _make_request
    conn.request(method, url, **httplib_request_kw)
  File "/usr/lib/python2.7/httplib.py", line 1057, in request
    self._send_request(method, url, body, headers)
  File "/usr/lib/python2.7/httplib.py", line 1097, in _send_request
    self.endheaders(body)
  File "/usr/lib/python2.7/httplib.py", line 1053, in endheaders
    self._send_output(message_body)
  File "/usr/lib/python2.7/httplib.py", line 897, in _send_output
    self.send(msg)
  File "/usr/lib/python2.7/httplib.py", line 873, in send
    self.sock.sendall(data)
  File "/usr/lib/python2.7/ssl.py", line 743, in sendall
    v = self.send(data[count:])
  File "/usr/lib/python2.7/ssl.py", line 709, in send
    v = self._sslobj.write(data)
OverflowError: string longer than 2147483647 bytes

Analysis process

requests read all file obj into memory

I didn’t expect that the implementation of requests is so crude. file.read(), in fact, is the same. When sending large files, the memory is rising rapidly. The code is as follows:
requests/models.py

    @staticmethod
    def _encode_files(files, data):
        """Build the body for a multipart/form-data request.

        Will successfully encode files when passed as a dict or a list of
        tuples. Order is retained if data is a list of tuples but arbitrary
        if parameters are supplied as a dict.
        The tuples may be 2-tuples (filename, fileobj), 3-tuples (filename, fileobj, contentype)
        or 4-tuples (filename, fileobj, contentype, custom_headers).
        """
        if (not files):
            raise ValueError("Files must be provided.")
        elif isinstance(data, basestring):
            raise ValueError("Data must not be a string.")

        new_fields = []
        fields = to_key_val_list(data or {})
        files = to_key_val_list(files or {})

        for field, val in fields:
            if isinstance(val, basestring) or not hasattr(val, '__iter__'):
                val = [val]
            for v in val:
                if v is not None:
                    # Don't call str() on bytestrings: in Py3 it all goes wrong.
                    if not isinstance(v, bytes):
                        v = str(v)

                    new_fields.append(
                        (field.decode('utf-8') if isinstance(field, bytes) else field,
                         v.encode('utf-8') if isinstance(v, str) else v))

        for (k, v) in files:
            # support for explicit filename
            ft = None
            fh = None
            if isinstance(v, (tuple, list)):
                if len(v) == 2:
                    fn, fp = v
                elif len(v) == 3:
                    fn, fp, ft = v
                else:
                    fn, fp, ft, fh = v
            else:
                fn = guess_filename(v) or k
                fp = v

            if isinstance(fp, (str, bytes, bytearray)):
                fdata = fp
            elif hasattr(fp, 'read'):
                fdata = fp.read() # Here all files are read into memory
            elif fp is None:
                continue
            else:
                fdata = fp

            rf = RequestField(name=k, data=fdata, filename=fn, headers=fh)
            rf.make_multipart(content_type=ft)
            new_fields.append(rf)

        body, content_type = encode_multipart_formdata(new_fields)

        return body, content_type

Official documents recommend using requests toolbar

https://2.python-requests.org…

In the event you are posting a very large file as a multipart/form-data request, you may want to stream the request. By default, requests does not support this, but there is a separate package which does — requests-toolbelt. You should read the toolbelt’s documentation for more details about how to use it.

Using requests toolbar

from requests_toolbelt import MultipartEncoder

    data = {}
    with open("bigfile", "rb") as f:
        data["xxx"] = ("filename", f)
        m = MultipartEncoder(fields=data)
        r = requests.post(PUBLISH_URL, data=m, headers={'Content-Type': m.content_type})

summary

The implementation of sending files for requests is very rough. It will read all the contents of the files directly to memory and sign again. If SSL sign is larger than 2GB, an error will be reported. In official documents, it is recommended to upload large files using requests toolkit
Block upload is also a solution (if supported by the server)

This limitation is in httplib. You can easily avoid it by slightly changing your code:

datafile = 'someHugeFile'
with open(datafile, 'rb') as myfile:
    resp = requests.put(url, data=myfile, verify=False)

@Lukasa that’s inaccurate. The traceback comes from an SSL wrapped socket. This has nothing to do with httplib from what I can see. OverflowErrors are raised when a value is larger than the underlying C integer size allowed. This can be seen if you call len(something_larger_than_four_gb) on a 32 bit system.

Unfortunately it looks like it cannot be avoided when you do a POST request with several headers. Then the file (or the files) is always read completely.

It would be great when this could be avoided in requests since I often have to send files which are longer than the available main memory on the system.

That’s true, but I don’t get to decide on the protocol and endpoint.

Doing the request with curl works fine, and as a workaround, I’m currently printing a curl command to STDOUT so that the user can launch it.

@eriktews can you share how you’re doing the upload? There are ways to stream uploads (like Lukasa’s comment shows). Is there a reason you cannot do that (if you are not already trying that)? Also, can you provide your actual traceback?

So the comment from Lukasa seems to work when you are uploading a single file, then you can do a streaming upload. But I have to do a normal post request with several variables in the data part and the file as a part of a multipart upload.

There is an API documentation at https://canvas.instructure.com/doc/api/file.file_uploads.html which shows a curl command in the «Step 2» section. Basically I wanna replicate that call with the requests package in streaming mode.

I don’t have the traceback at the moment, but when I get it, I will post it here.

Have you tried using the MultipartEncoder from the requests-toolbelt? That allows you to stream multipart/form-data uploads like doing an upload of a single file. It was written specifically for this use case and was added to our docs.

No, but that looks like right what I need. I will give it a try and see whether this works. I didn’t know about the toolbelt package at all, so maybe you should reference it in the normal requests package documentation.

so maybe you should reference it in the normal requests package documentation.

It is :)

@Lukasa ‘s method does not work — even with httplib off the signing still happens for the transport itself. In my case I have a 2GB+ POST request (not a file, just POST data). This is for an elasticsearch bulk update. The endpoint only has HTTPS so I’m working through other solutions now.

Traceback (most recent call last):
  File "/var/virtualenvs/centr/lib/python3.5/site-packages/elasticsearch/connection/http_requests.py", line 75, in perform_request
    timeout=timeout or self.timeout, verify=False)
  File "/var/virtualenvs/centr/lib/python3.5/site-packages/requests/sessions.py", line 609, in send
    r = adapter.send(request, **kwargs)
  File "/var/virtualenvs/centr/lib/python3.5/site-packages/requests/adapters.py", line 423, in send
    timeout=timeout
  File "/var/virtualenvs/centr/lib/python3.5/site-packages/requests/packages/urllib3/connectionpool.py", line 600, in urlopen
    chunked=chunked)
  File "/var/virtualenvs/centr/lib/python3.5/site-packages/requests/packages/urllib3/connectionpool.py", line 356, in _make_request
    conn.request(method, url, **httplib_request_kw)
  File "/usr/lib/python3.5/http/client.py", line 1106, in request
    self._send_request(method, url, body, headers)
  File "/usr/lib/python3.5/http/client.py", line 1151, in _send_request
    self.endheaders(body)
  File "/usr/lib/python3.5/http/client.py", line 1102, in endheaders
    self._send_output(message_body)
  File "/usr/lib/python3.5/http/client.py", line 936, in _send_output
    self.send(message_body)
  File "/usr/lib/python3.5/http/client.py", line 908, in send
    self.sock.sendall(data)
  File "/usr/lib/python3.5/ssl.py", line 891, in sendall
    v = self.send(data[count:])
  File "/usr/lib/python3.5/ssl.py", line 861, in send
    return self._sslobj.write(data)
  File "/usr/lib/python3.5/ssl.py", line 586, in write
    return self._sslobj.write(data)
OverflowError: string longer than 2147483647 bytes

Sorry, can you demonstrate your code please?

This should throw the error:

datafile = 'someHugeFile'
with open(datafile, 'rb') as myfile:
    r = requests.post(endpoint, data={'key': myfile.read()}, verify=False)

If endpoint is https then ssl will have to process the payload. I wonder if requests or requests-toolbelt could have an option to do the signature with some other library that doesn’t die when signing a 2GB string. Of course, I would say that people shouldn’t be signing such large things but it’s definitely a real crash that’s happening in the real world.

@adamn That was not my proposed solution. My proposed solution was to not read the file in manually at all. You are bumping into the same error as before, which is that we are sending a single gigantic string to httplib.

This is a behaviour we can fix: if we spot someone uploading a gigantic single string via Python then we can resolve it. But at this point I strongly recommend you use an intermediary file object: either one on disk, or by doing the urlencoding yourself and wrapping the result in a BytesIO.

I’ve already come up with a workaround so won’t be able to dig deeper into this unfortunately. I still suspect that the SSL payload needs to be signed/encrypted so the same thing will happen regardless of whether there is a file object or not since the exception is raised by ssl.write itself and I presume that method needs the entire payload. Chunking the POST seems like the only real option. Anyway, thanks for the help.

@adamn No, that’s not necessary. TLS uses stream encryption, it does not need the entire payload at once.

What you’re missing is that when given a file object, requests will automatically stream it in smaller chunks (specifically, 8192 kb chunks). Those cause no problem.

Sorry to comment on an old issue, but this looks _similar_ to an issue we’ve run into and I’m trying to decide whether it’s worth opening a new issue for it.

Again, requests.put where data is a _huge_ string doesn’t work, but we don’t get an error. requests just hangs sending; a packet capture shows that no more data is being sent.

This behaviour is _worse_ that an exception being raised.

Has the remote peer been ACKing at the TCP level? Is it still reading from the receive buffer? Has it TCP FIN’d?

Yes, the remote end is sending ACKs appropriately, no FIN or anything like that.

In fact, if you have a large file f, we see the problem if you do requests.put(url, data=f.read()) but not if you do requests.put(url, data=f). Obviously if we have a file handle, we wouldn’t bother to call read on it, but the point is that both calls are supposed generate the same request, and a packet capture shows that they do up until the point at which one stops sending packets.

Hrm. Is it possible for you to put together a small repro scenario? Do you see the same effect with other hosts?

As luck would have it, I have already done so.

Github doesn’t seem to want to let me attach files, so:

#!/usr/bin/env python

import requests


MB = 1024 ** 2
GB = MB * 1024


if __name__ == '__main__':
    data = 'x' * 4 * GB
    resp = requests.put('http://localhost:8000', data=data)
    print resp

And a server to run it against:

#!/usr/bin/env python

import BaseHTTPServer
import logging


READ_CHUNK = 1024 ** 2


class Handler(BaseHTTPServer.BaseHTTPRequestHandler):
    def do_PUT(self):
        logging.info("PUT request recieved")
        for header, value in self.headers.items():
            logging.info("header: %s = %s", header, value)

        length = int(self.headers['Content-Length'])

        logging.info("Content length %s, getting content...", length)

        while length:
            to_read = min(length, READ_CHUNK)
            logging.info("reading %s bytes...", to_read)
            self.rfile.read(to_read)
            length -= to_read

        logging.info("Recieved content")

        self.send_response(200)


def run(server_class=BaseHTTPServer.HTTPServer):
    server_address = ('', 8000)
    httpd = server_class(server_address, Handler)
    httpd.serve_forever()


if __name__ == '__main__':
    logging.basicConfig(
        level=logging.DEBUG,
        format="%(asctime)s %(levelname)s: %(message)s",
    )
    logging.debug("Starting server")
    run()

Obviously this isn’t the server we were running against when we first encountered this problem :)

Well as a first logical note I should point out that this is necessarily not the same problem as originally reported on this issue, as the original report affected TLS only, as discussed above. :wink: Regardless, let’s dig into this a bit.

Ah, sorry. Is it worth me opening a new issue then, or should I just leave it here, since you’re already looking at it?

Let’s leave it here for now. =)

Huh. That behaves…very oddly. On my machine, over the loopback, I don’t see any data sent at all: it’s like Requests just gave up on sending it. Further debugging seems to show this is happening at the level of socket.sendall, which for some absurd reason is just not sending the complete response. By «not sending the complete response» I mean socket.sendall is returning early, but demonstrably is not sending all the data.

Naturally, the reason this happens is the same as the reason Python does lots of other stupid crap: socket.sendall is written in C. The very first thing that socket.sendall does is get the length of the data that was sent into it and shoves it into a C int. Now, this is wrong to begin with: Py_buffer.len is a Py_ssize_t, and sizeof(ssize_t) is frequently larger than sizeof(int). So that’s bonkers stupid, and probably the source of this bug.

In fact, it definitely is, since the current Python master has a changed sendall that uses the correct size. This seems to have been cleaned up around Python 3 time as a general «64-bit issue» (see python/[email protected]) in the socket module.

That makes this ultimately a duplicate of CPython issue #18100. This has been open a long time in need of patch review, and given that Python 2.7 is now only getting security fixes I doubt the CPython developers will fix it at this point.

This is a difficult issue for Requests to sensibly police. We can tell when people will definitely hit it (e.g. because the input is a string which a length greater than 2GB), but there are many situations where people will hit it but we can’t tell (e.g. because the string plus the headers is greater than 2GB in size, or because there is a different type in use that CPython will treat as «stringish» that is larger than 2GB). So my initial inclination is, given that this is an issue that can be solved by moving to a newer version of Python, and that it can be worked around by not reading gigantic strings into memory (which is a best-practice anyway), and that if we ever move off httplib we’ll fix it automatically anyway, I’m inclined to suggest that we probably don’t have a huge pressure to resolve the issue? For my part, I think this is getting pretty close to «Dr, it hurts when I do this.» «So don’t do that then!» territory.

However, I’m willing to be disagreed with here.

The workaround is really simple — just wrap it in StringIO/BytesIO, but when you run into it, it’s difficult to diagnose, so any help that requests could give, even if it’s just a warning in the documentation, would be appreciated.

I can get behind the idea of a PR that adds a note in the documentation.

Running into the same problem,I dont this thread has an accepted solution,anyone has a solution for this ?

This also results in an OverflowError in self._sslobj.write(data):

files = {'file': open(tar_file_path, 'rb')}
headers = {'key': 'abc123'}
r = requests.post(url, files=files, headers=headers)

The file is 3GB in size.

@SpoonMeiser Wrapping the file contents in a BytesIO does not fix the problem on Python 3.6, still raises the exact same error.

I’m having the same basic issue as @gjedeer and see the same behavior as @cmbasnett (that wrapping in BytesIO is not a solution). I’m trying to use a file object to upload something larger than 2GB using a TLS encrypted post. Specifically I’m trying to use a presigned url to upload a file to S3. It appears that the underlying ssl library in python doesn’t like files over 2GB. Is there an accepted workaround to this? Stack trace:

Basic code:

 with open(self.path_to_data, 'rb') as f:
    fields = 'defined elsewhere...'
    files = {'file': f}
    request('post', url, data=fields, files=files)
Traceback (most recent call last):
  File "/Users/coltonhicks/.virtualenvs/mds_cli-AtYG3_5U/bin/mds", line 11, in <module>
    load_entry_point('mdscli', 'console_scripts', 'mds')()
  File "/Users/coltonhicks/.virtualenvs/mds_cli-AtYG3_5U/lib/python3.7/site-packages/click/core.py", line 764, in __call__
    return self.main(*args, **kwargs)
  File "/Users/coltonhicks/.virtualenvs/mds_cli-AtYG3_5U/lib/python3.7/site-packages/click/core.py", line 717, in main
    rv = self.invoke(ctx)
  File "/Users/coltonhicks/.virtualenvs/mds_cli-AtYG3_5U/lib/python3.7/site-packages/click/core.py", line 1137, in invoke
    return _process_result(sub_ctx.command.invoke(sub_ctx))
  File "/Users/coltonhicks/.virtualenvs/mds_cli-AtYG3_5U/lib/python3.7/site-packages/click/core.py", line 956, in invoke
    return ctx.invoke(self.callback, **ctx.params)
  File "/Users/coltonhicks/.virtualenvs/mds_cli-AtYG3_5U/lib/python3.7/site-packages/click/core.py", line 555, in invoke
    return callback(*args, **kwargs)
  File "/Users/coltonhicks/.virtualenvs/mds_cli-AtYG3_5U/lib/python3.7/site-packages/click/decorators.py", line 64, in new_func
    return ctx.invoke(f, obj, *args, **kwargs)
  File "/Users/coltonhicks/.virtualenvs/mds_cli-AtYG3_5U/lib/python3.7/site-packages/click/core.py", line 555, in invoke
    return callback(*args, **kwargs)
  File "/Users/coltonhicks/dev/mds_cli/mdscli/cli.py", line 133, in upload_member_data
    uploader.main()
  File "/Users/coltonhicks/dev/mds_cli/mdscli/upload_member_data.py", line 30, in main
    self.client.upload_member_data(self.mida, self.data_type, f)
  File "/Users/coltonhicks/dev/mds_cli/mdscli/requests_client.py", line 300, in upload_member_data
    logger.info(
  File "/Users/coltonhicks/dev/mds_cli/mdscli/requests_client.py", line 186, in _request
    res = requests.request(method, url, data=data, files=files)
  File "/Users/coltonhicks/.virtualenvs/mds_cli-AtYG3_5U/lib/python3.7/site-packages/requests/api.py", line 60, in request
    return session.request(method=method, url=url, **kwargs)
  File "/Users/coltonhicks/.virtualenvs/mds_cli-AtYG3_5U/lib/python3.7/site-packages/requests/sessions.py", line 533, in request
    resp = self.send(prep, **send_kwargs)
  File "/Users/coltonhicks/.virtualenvs/mds_cli-AtYG3_5U/lib/python3.7/site-packages/requests/sessions.py", line 646, in send
    r = adapter.send(request, **kwargs)
  File "/Users/coltonhicks/.virtualenvs/mds_cli-AtYG3_5U/lib/python3.7/site-packages/requests/adapters.py", line 449, in send
    timeout=timeout
  File "/Users/coltonhicks/.virtualenvs/mds_cli-AtYG3_5U/lib/python3.7/site-packages/urllib3/connectionpool.py", line 603, in urlopen
    chunked=chunked)
  File "/Users/coltonhicks/.virtualenvs/mds_cli-AtYG3_5U/lib/python3.7/site-packages/urllib3/connectionpool.py", line 355, in _make_request
    conn.request(method, url, **httplib_request_kw)
  File "/usr/local/Cellar/python/3.7.3/Frameworks/Python.framework/Versions/3.7/lib/python3.7/http/client.py", line 1229, in request
    self._send_request(method, url, body, headers, encode_chunked)
  File "/usr/local/Cellar/python/3.7.3/Frameworks/Python.framework/Versions/3.7/lib/python3.7/http/client.py", line 1275, in _send_request
    self.endheaders(body, encode_chunked=encode_chunked)
  File "/usr/local/Cellar/python/3.7.3/Frameworks/Python.framework/Versions/3.7/lib/python3.7/http/client.py", line 1224, in endheaders
    self._send_output(message_body, encode_chunked=encode_chunked)
  File "/usr/local/Cellar/python/3.7.3/Frameworks/Python.framework/Versions/3.7/lib/python3.7/http/client.py", line 1055, in _send_output
    self.send(chunk)
  File "/usr/local/Cellar/python/3.7.3/Frameworks/Python.framework/Versions/3.7/lib/python3.7/http/client.py", line 977, in send
    self.sock.sendall(data)
  File "/usr/local/Cellar/python/3.7.3/Frameworks/Python.framework/Versions/3.7/lib/python3.7/ssl.py", line 1015, in sendall
    v = self.send(byte_view[count:])
  File "/usr/local/Cellar/python/3.7.3/Frameworks/Python.framework/Versions/3.7/lib/python3.7/ssl.py", line 984, in send
    return self._sslobj.write(data)
OverflowError: string longer than 2147483647 bytes

Regarding the newer (since 2018) questions for a solution here:
The workaround using requests-toolbelt works for me in Python 3.8.

The requests-code to upload was the following:
requests.request(method, url, *args, **kwargs)
…with the kwargs being:
{'files': {'file': ({filename}', <_io.BufferedReader name='{filepath}'>), 'parent_dir': '/bert_files/bert_models'}, 'headers': {'Authorization': 'Token {token}'}}.
Wrapping it like this:

m = MultipartEncoder(
    fields={'file': (kwargs['files']['file'][1].name, open(kwargs['files']['file'][1].name, 'rb'), 'text/plain'),
            'parent_dir': kwargs['files']['parent_dir']}
)
del kwargs['files']
kwargs['data'] = m
kwargs['headers']['Content-Type'] =  m.content_type

…leading to these kwargs:
{'headers': {'Authorization': 'Token {token}', 'Content-Type': 'multipart/form-data; boundary={boundary}'}, 'data': <MultipartEncoder: {'file': ('filename', <_io.BufferedReader name={filepath}'>, 'text/plain'), 'parent_dir': '{parent_tir}'}>}

Like this it worked perfectly for me.

I need to upload all the files in a directory through the API, the largest file is about 20 gb, but I keep getting an OverflowError with files larger than 2GB. I’m using the example of this thread (https://www.dropboxforum.com/t5/API-Support-Feedback/python-upload-big-file-example/m-p/166626). But I can’t make it work.

This is the code:

def subebackups():
    dbx = dropbox.Dropbox('MY_APP_TOKEN',timeout=36000)
    rootdir = 'ROOT_DIR'
    file_size = os.path.getsize(rootdir)
    CHUNK_SIZE = 50 * 1024 * 1024
    for dir, dirs, files in os.walk(rootdir):
        for file in files:
            file_path = os.path.join(dir, file)
            dest_path =  os.path.join('/Backups/Backups', file).replace("\","/")
 
            with open(file_path, "rb") as f:
                if file_size <= CHUNK_SIZE:
                    dbx.files_upload(f.read(), dest_path)

                else:
                
                   upload_session_start_result = dbx.files_upload_session_start(f.read(CHUNK_SIZE))
                   cursor = dropbox.files.UploadSessionCursor(session_id=upload_session_start_result.session_id,
                                                              offset=f.tell())
                   commit = dropbox.files.CommitInfo(path=dest_path)

                   while f.tell() < file_size:
                       if ((file_size - f.tell()) <= CHUNK_SIZE):
                           print(dbx.files_upload_session_finish(f.read(CHUNK_SIZE),
                                                           cursor,
                                                           commit))
                       else:
                           dbx.files_upload_session_append(f.read(CHUNK_SIZE),
                                                           cursor.session_id,
                                                           cursor.offset)
                           cursor.offset = f.tell()

This is the ouput with files larger than 2 GB:

Traceback (most recent call last):
File «C:UsersOrdicoDownloadstools-mastertools-masterdropboxSubeBackupsDropbox.py», line 66, in <module>
subebackups()
File «C:UsersOrdicoDownloadstools-mastertools-masterdropboxSubeBackupsDropbox.py», line 40, in subebackups
dbx.files_upload(f.read(), dest_path)
File «C:envdropboxlibsite-packagesdropboxbase.py», line 2960, in files_upload
r = self.request(
File «C:envdropboxlibsite-packagesdropboxdropbox.py», line 311, in request
res = self.request_json_string_with_retry(host,
File «C:envdropboxlibsite-packagesdropboxdropbox.py», line 461, in request_json_string_with_retry
return self.request_json_string(host,
File «C:envdropboxlibsite-packagesdropboxdropbox.py», line 557, in request_json_string
r = self._session.post(url,
File «C:envdropboxlibsite-packagesrequestssessions.py», line 578, in post
return self.request(‘POST’, url, data=data, json=json, **kwargs)
File «C:envdropboxlibsite-packagesrequestssessions.py», line 530, in request
resp = self.send(prep, **send_kwargs)
File «C:envdropboxlibsite-packagesrequestssessions.py», line 643, in send
r = adapter.send(request, **kwargs)
File «C:envdropboxlibsite-packagesrequestsadapters.py», line 439, in send
resp = conn.urlopen(
File «C:envdropboxlibsite-packagesurllib3connectionpool.py», line 670, in urlopen
httplib_response = self._make_request(
File «C:envdropboxlibsite-packagesurllib3connectionpool.py», line 392, in _make_request
conn.request(method, url, **httplib_request_kw)
File «c:usersordicoappdatalocalprogramspythonpython38libhttpclient.py», line 1230, in request
self._send_request(method, url, body, headers, encode_chunked)
File «c:usersordicoappdatalocalprogramspythonpython38libhttpclient.py», line 1276, in _send_request
self.endheaders(body, encode_chunked=encode_chunked)
File «c:usersordicoappdatalocalprogramspythonpython38libhttpclient.py», line 1225, in endheaders
self._send_output(message_body, encode_chunked=encode_chunked)
File «c:usersordicoappdatalocalprogramspythonpython38libhttpclient.py», line 1043, in _send_output
self.send(chunk)
File «c:usersordicoappdatalocalprogramspythonpython38libhttpclient.py», line 965, in send
self.sock.sendall(data)
File «c:usersordicoappdatalocalprogramspythonpython38libssl.py», line 1204, in sendall
v = self.send(byte_view[count:])
File «c:usersordicoappdatalocalprogramspythonpython38libssl.py», line 1173, in send
return self._sslobj.write(data)
OverflowError: string longer than 2147483647 bytes

Hey all,

I’m trying to script out an overwrite workflow using the method found here:

*Relevant code from above link (Overwrite Feature Layer section near the bottom):

from arcgis.features import FeatureLayerCollection
cities_flayer_collection = FeatureLayerCollection.fromitem(cities_item)
#call the overwrite() method which can be accessed using the manager property
cities_flayer_collection.manager.overwrite(os.path.join('data', 'updating_gis_content', 'updated_capitals_csv', 'capitals_1.csv'))

I’ve used this in the past for other scripts and it works swell. However, I now have a need to create a single «master» feature layer comprised of multiple datasets that have been merged together, and it comes out as a pretty big file size when zipped together in a fgdb for upload (2.1 GB). When I manually execute the Overwrite using the menu buttons on the item detail page in ArcGIS Online, it uploads and overwrites the feature layer from my zipped fgdb with no problem.

When I try to replicate this via the scripting method above, I get the following error:

Traceback (most recent call last):
  File «C:/Users/user_user/.PyCharmCE2019.3/config/scratches/DRP_Update_SNAPP_Master.py», line 146, in <module>
    flayer.manager.overwrite(newdata)
  File «C:Program FilesArcGISProbinPythonenvsarcgispro-py3libsite-packagesarcgisfeaturesmanagers.py», line 1268, in overwrite
    if related_data_item.update(item_properties=params, data=data_file):
  File «C:Program FilesArcGISProbinPythonenvsarcgispro-py3libsite-packagesarcgisgis__init__.py», line 8165, in update
    large_thumbnail)
  File «C:Program FilesArcGISProbinPythonenvsarcgispro-py3libsite-packagesarcgis_implportalpy.py», line 2228, in update_item
    resp = self.con.post(path, postdata, files)
  File «C:Program FilesArcGISProbinPythonenvsarcgispro-py3libsite-packagesarcgis_implconnection.py», line 1106, in post
    resp = opener.open(req)
  File «C:Program FilesArcGISProbinPythonenvsarcgispro-py3liburllibrequest.py», line 526, in open
    response = self._open(req, data)
  File «C:Program FilesArcGISProbinPythonenvsarcgispro-py3liburllibrequest.py», line 544, in _open
    ‘_open’, req)
  File «C:Program FilesArcGISProbinPythonenvsarcgispro-py3liburllibrequest.py», line 504, in _call_chain
    result = func(*args)
  File «C:Program FilesArcGISProbinPythonenvsarcgispro-py3liburllibrequest.py», line 1361, in https_open
    context=self._context, check_hostname=self._check_hostname)
  File «C:Program FilesArcGISProbinPythonenvsarcgispro-py3liburllibrequest.py», line 1318, in do_open
    encode_chunked=req.has_header(‘Transfer-encoding’))
  File «C:Program FilesArcGISProbinPythonenvsarcgispro-py3libhttpclient.py», line 1254, in request
    self._send_request(method, url, body, headers, encode_chunked)
  File «C:Program FilesArcGISProbinPythonenvsarcgispro-py3libhttpclient.py», line 1300, in _send_request
    self.endheaders(body, encode_chunked=encode_chunked)
  File «C:Program FilesArcGISProbinPythonenvsarcgispro-py3libhttpclient.py», line 1249, in endheaders
    self._send_output(message_body, encode_chunked=encode_chunked)
  File «C:Program FilesArcGISProbinPythonenvsarcgispro-py3libhttpclient.py», line 1075, in _send_output
    self.send(chunk)
  File «C:Program FilesArcGISProbinPythonenvsarcgispro-py3libhttpclient.py», line 996, in send
    self.sock.sendall(data)
  File «C:Program FilesArcGISProbinPythonenvsarcgispro-py3libssl.py», line 975, in sendall
    v = self.send(byte_view[count:])
  File «C:Program FilesArcGISProbinPythonenvsarcgispro-py3libssl.py», line 944, in send
    return self._sslobj.write(data)
  File «C:Program FilesArcGISProbinPythonenvsarcgispro-py3libssl.py», line 642, in write
    return self._sslobj.write(data)
OverflowError: string longer than 2147483647 bytes

I haven’t found any sort of note within the API reference stating that the overwrite uploads are limited to 2GB — and it works when I do it manually — so I’m confused what I’m running into here, and unsure how to work around it. Is it a limitation of the urllib or ssl packages that the .overwrite function uses? 

-Aaron

Понравилась статья? Поделить с друзьями:
  • Overflow adc ошибка онк 160
  • Overcooked 2 has encountered an error and must close
  • Overclocking may cause damage to your cpu and motherboard как исправить
  • Overclocking failed ошибка
  • Overclocking failed please enter setup to reconfigure your system как исправить