一级视频在线观看,成年人视频免费看,啪啪在线视频

本文介紹了從 FTP python 讀取緩沖區(qū)中的文件的處理方法，對(duì)大家解決問題具有一定的參考價(jià)值，需要的朋友們下面隨著小編來一起學(xué)習(xí)吧！

問題描述

我正在嘗試從 FTP 服務(wù)器讀取文件.該文件是一個(gè) .gz 文件.我想知道我是否可以在套接字打開時(shí)對(duì)此文件執(zhí)行操作.我試圖遵循讀取文件而不寫入磁盤和從 FTP 讀取文件而不下載但不成功.

I am trying to read a file from an FTP server. The file is a .gz file. I would like to know if I can perform actions on this file while the socket is open. I tried to follow what was mentioned in two StackOverflow questions on reading files without writing to disk and reading files from FTP without downloading but was not successful.

我知道如何在下載的文件上提取數(shù)據(jù)/工作，但我不確定我是否可以即時(shí)完成.有沒有辦法連接到站點(diǎn)，在緩沖區(qū)中獲取數(shù)據(jù)，可能進(jìn)行一些數(shù)據(jù)提取并退出?

I know how to extract data/work on the downloaded file but I'm not sure if I can do it on the fly. Is there a way to connect to the site, get data in a buffer, possibly do some data extraction and exit?

嘗試 StringIO 時(shí)出現(xiàn)錯(cuò)誤:

When trying StringIO I got the error:

>>> from ftplib import FTP
>>> from StringIO import StringIO
>>> ftp = FTP('ftp://ftp.ncbi.nlm.nih.gov/pub/pmc/PMC-ids.csv.gz')

Traceback (most recent call last):
File "<pyshell#2>", line 1, in <module>
ftp = FTP('ftp://ftp.ncbi.nlm.nih.gov/pub/pmc/PMC-ids.csv.gz')
File "C:Python27libftplib.py", line 117, in __init__
self.connect(host)
File "C:Python27libftplib.py", line 132, in connect
self.sock = socket.create_connection((self.host, self.port), self.timeout)
File "C:Python27libsocket.py", line 553, in create_connection
for res in getaddrinfo(host, port, 0, SOCK_STREAM):
gaierror: [Errno 11004] getaddrinfo failed

我只需要知道如何將數(shù)據(jù)放入某個(gè)變量并在其上循環(huán)，直到讀取來自 FTP 的文件.

I just need to know how can I get data into some variable and loop on it until the file from FTP is read.

感謝您的寶貴時(shí)間和幫助.謝謝！

I appreciate your time and help. Thanks!

推薦答案

請(qǐng)務(wù)必先登錄ftp服務(wù)器.之后，使用 retrbinary 以二進(jìn)制模式拉取文件.它對(duì)文件的每個(gè)塊使用回調(diào).您可以使用它來將其加載到字符串中.

Make sure to login to the ftp server first. After this, use retrbinary which pulls the file in binary mode. It uses a callback on each chunk of the file. You can use this to load it into a string.

from ftplib import FTP
ftp = FTP('ftp.ncbi.nlm.nih.gov')
ftp.login() # Username: anonymous password: anonymous@

# Setup a cheap way to catch the data (could use StringIO too)
data = []
def handle_binary(more_data):
    data.append(more_data)

resp = ftp.retrbinary("RETR pub/pmc/PMC-ids.csv.gz", callback=handle_binary)
data = "".join(data)

加分項(xiàng):我們?cè)诮鈮鹤址畷r(shí)如何?

Bonus points: how about we decompress the string while we're at it?

簡(jiǎn)單模式，使用上面的數(shù)據(jù)字符串

Easy mode, using data string above

import gzip
import StringIO
zippy = gzip.GzipFile(fileobj=StringIO.StringIO(data))
uncompressed_data = zippy.read()

稍微好一點(diǎn)，完整的解決方案:

from ftplib import FTP
import gzip
import StringIO

ftp = FTP('ftp.ncbi.nlm.nih.gov')
ftp.login() # Username: anonymous password: anonymous@

sio = StringIO.StringIO()
def handle_binary(more_data):
    sio.write(more_data)

resp = ftp.retrbinary("RETR pub/pmc/PMC-ids.csv.gz", callback=handle_binary)
sio.seek(0) # Go back to the start
zippy = gzip.GzipFile(fileobj=sio)

uncompressed = zippy.read()

實(shí)際上，動(dòng)態(tài)解壓縮會(huì)好得多，但我看不到使用內(nèi)置庫的方法(至少不容易).

In reality, it would be much better to decompress on the fly but I don't see a way to do that with the built in libraries (at least not easily).

這篇關(guān)于從 FTP python 讀取緩沖區(qū)中的文件的文章就介紹到這了，希望我們推薦的答案對(duì)大家有所幫助，也希望大家多多支持html5模板網(wǎng)！

【網(wǎng)站聲明】本站部分內(nèi)容來源于互聯(lián)網(wǎng),旨在幫助大家更快的解決問題，如果有圖片或者內(nèi)容侵犯了您的權(quán)益，請(qǐng)聯(lián)系我們刪除處理，感謝您的支持！

pbootcms网站模板|日韩1区2区|织梦模板||网站源码|日韩1区2区|jquery建站特效-html5模板网

從 FTP python 讀取緩沖區(qū)中的文件

問題描述

推薦答案

相關(guān)文檔推薦