Złączone dane - Forum PHP.pl

Forum PHP.pl > Forum > PHP > Object-oriented programming

Złączone dane

SN@JPER^ Zobacz profil	24.11.2017, 17:44:49 Post #1
Grupa: Zarejestrowani Postów: 266 Pomógł: 0 Dołączył: 4.01.2007 Skąd: Szczecin Ostrzeżenie: (0%)	Napisałem sobie taką oto klasę: [PHP] pobierz, plaintext <?php class Scrapper{ public $url; private $data; private $dataAfter; private $doc; private $xpath; private $ch; function __construct($url){ if (preg_match('/^http/', $url)) { libxml_use_internal_errors(true); $this->url = $url; $this->data = $this->curl($this->url); $this->doc = new \DOMDocument(); $this->doc->loadHTML($this->data); $this->xpath = new DOMXPath($this->doc); } } public function queryTag($query){ if(!empty($query)){ $this->data = $this->xpath->query($query); return $this; } } public function getData($noHTML = false, $removeAttribute = false){ foreach ($this->data as $dataNodes){ if($removeAttribute === true) { $dataNodes->removeAttribute('style'); $dataNodes->removeAttribute('class'); $dataNodes->removeAttribute('id'); } if($noHTML === true){ $this->dataAfter .= $dataNodes->nodeValue; }else{ $this->dataAfter .= $dataNodes->ownerDocument->saveHTML($dataNodes); } } return $this->dataAfter; } private function curl($url){ if(!empty($url)) { $options = Array( CURLOPT_RETURNTRANSFER => TRUE, // Setting cURL's option to return the webpage data CURLOPT_FOLLOWLOCATION => TRUE, // Setting cURL to follow 'location' HTTP headers CURLOPT_AUTOREFERER => TRUE, // Automatically set the referer where following 'location' HTTP headers CURLOPT_CONNECTTIMEOUT => 120, // Setting the amount of time (in seconds) before the request times out CURLOPT_TIMEOUT => 120, // Setting the maximum amount of time for cURL to execute queries CURLOPT_MAXREDIRS => 10, // Setting the maximum number of redirections to follow CURLOPT_USERAGENT => "Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.1a2pre) Gecko/2008073000 Shredder/3.0a2pre ThunderBrowse/3.2.1.8", // Setting the useragent CURLOPT_URL => $this->url, // Setting cURL's URL option with the $url variable passed into the function ); $this->ch = curl_init(); curl_setopt_array($this->ch, $options); $this->data = curl_exec($this->ch); return $this->data; } } function __destruct(){ curl_close($this->ch); } } $class = new \Scrapper('http://www.....'); $pic = $class->queryTag('//div[@id="left"]//img[@class="pic"]/@src')->getData(); $title = $class->queryTag('//div[@id="left"]//h2')->getData(true); $text = $class->queryTag('//div[@id="left"]/p \| //center')->getData(false, true); echo $title; echo '<hr>'; echo $pic; echo '<hr>'; echo $text; echo '<hr>'; [PHP] pobierz, plaintext Po wywołaniu tej klasy, przypisuję do każdej zmiennej szukanej wartości - zdjęcie, tytuł i treść. Niestety tytuł zawiera również ciąg URL obrazka, natomiast tekst zawiera dodatkowo obrazek oraz tytuł. Gdzie robię błąd? Jak to oddzielić? Jednocześnie proszę o sugestię co mogę poprawić w samej klasie. Ten post edytował SN@JPER^ 24.11.2017, 17:47:16

Posty w temacie

SN@JPER^ Złączone dane 24.11.2017, 17:44:49

trueblue Pokaż kawałek tej struktury, którą parsujesz. 24.11.2017, 18:59:11

Pyton_000 Jak dla mnie to ta klasa sama w sobie jest do zaor... 24.11.2017, 19:05:40

SN@JPER^ Cytat(trueblue @ 24.11.2017, 18:59:11... 24.11.2017, 19:51:31

trueblue Wciąż doklejasz dane do dataAfter. 24.11.2017, 20:08:08

SN@JPER^ Działa gdy zmieniłem na: [PHP] pobierz, plaintex... 24.11.2017, 20:15:40

abriljoseph The Semantic Web is a Web of Data — of dates... 24.04.2018, 07:13:18

« Następny starszy · Object-oriented programming · Następny nowszy »

1 Użytkowników czyta ten temat (1 Gości i 0 Anonimowych użytkowników)

0 Zarejestrowanych:

Tryb wyświetlania: Przełącz na: Standardowy · Przełącz na: Linearny+ · Drzewo

Śledź ten temat · Wyślij temat na e-mail · Wydrukuj ten temat · Subskrybuj to forum

Wersja Lo-Fi

Aktualny czas: 25.04.2024 - 09:48

Hosting zapewnia