Show Posts

This section allows you to view all posts made by this member. Note that you can only see posts made in areas you currently have access to.


Messages - NetworkShark

Pages: [1] 2
1
Other Topics / Re: Hello from NetworkShark
« on: December 18, 2024, 10:02:38 pm »
Hy guys,

I believe it's impossible to retrieve the data in the traditional way.

The url to call is like this:
Code: [Select]
https://caching.graphql.imdb.com/? operationName=TitleReleaseDatesPaginated&variables={“after”: “NQ==”, “const”: “tt0147800”, “first”:50, “locale”: “en-US”, “originalTitleText”: false}&extensions={“persistedQuery”:{“sha256Hash”:“0e4e6468b8bc55114f80551e7a062301c78999ee538789a936902e4ab5239ccd”,“version”:1}}
The page has all the data except the sha256Hash, which seems to be a key for both authentication and data retrieval.

Sorry :'(

2
Other Topics / Re: Hello from NetworkShark
« on: December 15, 2024, 11:28:26 am »

Are you sure the AKA data is dynamic? I've dowloaded by linux machine and the data are here in the page.

I do some tests with the script 2c2c you have linked here and I return with some answers

I hadn't noticed the button "50 more", good morning to me!  ;D ;D

3
Other Topics / Re: Hello from NetworkShark
« on: December 15, 2024, 11:00:43 am »
NetworkShark I have a question for you.

Would it be possible to write a script to download all IMDb Also Known As (AKA) titles from, for example, this url

https://www.imdb.com/title/tt0147800/releaseinfo/?ref_=tt_ov_at_dt_rdat

Namely, IMDb uses dynamic display of Also Known As (AKA) titles and uses this method on some other IMDb websites.

I am adding the IMDB_[EN][HTTPS]_TEST_2c 2c script to help make it easier.

I hope that some solution will be found within PVD so that it can be used to transfer information to the pvd database.

Are you sure the AKA data is dynamic? I've dowloaded by linux machine and the data are here in the page.

I do some tests with the script 2c2c you have linked here and I return with some answers



Sorry that I'm in the conversation uninvited and not knowing programming.

No problem afrocuban, indeed I am interested in how I can integrate other ways to improve info retrieval from PVD

Maybe this part can be a separate thread in this forum


But I learned a bit along the way trying to locally download FA page with trailers. I succeeded to download it as downpage_trailer-UTF8_NO_BOM_FA.htm beside downpage-UTF8_NO_BOM.htm in order to try to parse them both, but no good news. I mean, I can parse them both, but dynamic content isn't downloaded,[/size]

I don't think we can do it with PVdBDownPage.exe:

Quote
// Download the URL page.
  LogMessage(Chr(9) + Chr(9) + '      Download with PVdBDownPage in file:|' + ScriptPath + OutFile + ' the information of:|' + URL + '||');
  FileExecute(ScriptPath + 'PVdBDownPage.exe', '"' + URL + '" "' + ScriptPath + OutFile + '"');

because it cannot download dynamic content since it
Quote
is likely due to the JavaScript that loads the video elements dynamically on the original page. When you download the page using DownloadPage, the JavaScript doesn’t execute, so the dynamic content (like embedded trailers) isn’t included in the downloaded HTML file.
To work around this, we have a couple of options:

Use a Web Browser Automation Tool: Tools like Selenium can control a web browser to visit the page and let all JavaScript execute before saving the fully rendered page’s HTML. This way, the dynamic content will be included.

Find the Video URL Manually: Check the original HTML of the page in your browser (right-click and “View Page Source” or inspect elements with Developer Tools). Look for the actual URLs of the video files. You can then use those URLs directly in your script.

Using a web browser automation tool like Selenium is a bit more involved and requires additional setup, but it is a more robust solution for handling dynamic content. If you prefer to go this route, I can guide you through the setup process.

Alternatively, manually inspecting the page’s source for video URLs is quicker but less convenient for repeated tasks.



Quote
Here's the rest of the conversation:
Invoking Selenium directly from within a PVD script (written in Pascal) is not straightforward because Selenium typically operates via Python, Java, or another high-level language. However, you can use a workaround by invoking an external Python script from your PVD Pascal script.

Here’s how you can integrate Selenium into your workflow:

Create a Python Script Using Selenium:

This script will navigate to the URL, wait for the content to load, and save the fully rendered HTML.

python
Quote
from selenium import webdriver
from selenium.webdriver.common.by import By
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.support import expected_conditions as EC

# Setup the WebDriver
driver = webdriver.Chrome(executable_path='path_to_chromedriver')

# Navigate to the URL
driver.get("https://www.filmaffinity.com/en/evideos.php?movie_id=xxxx")

# Wait for the trailers to load (adjust the waiting condition as needed)
WebDriverWait(driver, 10).until(
    EC.presence_of_element_located((By.CLASS_NAME, "divvid"))
)

# Get the fully rendered HTML
html = driver.page_source

# Save the HTML to a file
with open('downpage_trailer-UTF8_NO_BOM_FA.htm', 'w', encoding='utf-8') as file:
    file.write(html)

# Close the browser

from selenium.webdriver.common.by import By
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.support import expected_conditions as EC

# Setup the WebDriver
driver = webdriver.Chrome(executable_path='path_to_chromedriver')

# Navigate to the URL
driver.get("https://www.filmaffinity.com/en/evideos.php?movie_id=xxxx")

# Wait for the trailers to load (adjust the waiting condition as needed)
WebDriverWait(driver, 10).until(
    EC.presence_of_element_located((By.CLASS_NAME, "divvid"))
)

# Get the fully rendered HTML
html = driver.page_source

# Save the HTML to a file
with open('downpage_trailer-UTF8_NO_BOM_FA.htm', 'w', encoding='utf-8') as file:
    file.write(html)

# Close the browser
driver.quit()

Invoke the Python Script from Your PVD Script:

Use the FileExecute method to call the Python script from within your Pascal script.

pascal
Quote
// Log the action
LogMessage(Chr(9) + Chr(9) + '      Download with Selenium in file:|' + ScriptPath + OutFile + ' the information of:|' + URL + '||');

// Construct the command to run the Python script
FileExecute('python.exe', ScriptPath + 'selenium_script.py');
In this setup:

The Python script handles the browsing and downloading of the fully rendered HTML using Selenium.

Your PVD script uses the FileExecute method to call the Python script, passing any necessary parameters if needed.

After the Python script finishes and the HTML is saved, your PVD script can continue processing the saved HTML as usual.

Example Complete Workflow
1. Python Script (selenium_script.py)

python
Quote
from selenium import webdriver
from selenium.webdriver.common.by import By
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.support import expected_conditions as EC
import sys

# Get the URL from the command line arguments
url = sys.argv[1]
output_file = sys.argv[2]

# Setup the WebDriver
driver = webdriver.Chrome(executable_path='path_to_chromedriver')

# Navigate to the URL
driver.get(url)

# Wait for the trailers to load (adjust the waiting condition as needed)
WebDriverWait(driver, 10).until(
    EC.presence_of_element_located((By.CLASS_NAME, "divvid"))
)

# Get the fully rendered HTML
html = driver.page_source

# Save the HTML to a file
with open(output_file, 'w', encoding='utf-8') as file:
    file.write(html)

# Close the browser
driver.quit()
2. Pascal Script (Invoke Selenium Script)

pascal
Quote
// Log the action
LogMessage(Chr(9) + Chr(9) + '      Download with Selenium in file:|' + ScriptPath + OutFile + ' the information of:|' + URL + '||');

// Construct the command to run the Python script
FileExecute('python.exe', '"' + ScriptPath + 'selenium_script.py" "' + URL + '" "' + ScriptPath + OutFile + '"');
By following this workflow, you can effectively integrate Selenium into your PVD script to handle pages with dynamic content. This way, you can ensure that all necessary content, like embedded trailers, is included in the HTML that your PVD script processes.

How would selenium know which url to open?
Great question. You can pass the URL to Selenium through command line arguments when you invoke the Python script. This way, your PVD script can dynamically supply the URL to Selenium. Here's how you can do it:

Step-by-Step Implementation
Modify Your Python Script to Accept Command Line Arguments:

You’ll adjust the Python script to accept the URL as an argument. This ensures that Selenium knows which URL to open.

Example Python Script (selenium_script.py):

python
Quote
from selenium import webdriver
from selenium.webdriver.common.by import By
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.support import expected_conditions as EC
import sys

# Get the URL and output file from the command line arguments
url = sys.argv[1]
output_file = sys.argv[2]

# Setup the WebDriver
driver = webdriver.Chrome(executable_path='path_to_chromedriver')

# Navigate to the URL
driver.get(url)

# Wait for the trailers to load (adjust the waiting condition as needed)
WebDriverWait(driver, 10).until(
    EC.presence_of_element_located((By.CLASS_NAME, "divvid"))
)

# Get the fully rendered HTML
html = driver.page_source

# Save the HTML to a file
with open(output_file, 'w', encoding='utf-8') as file:
    file.write(html)

# Close the browser
driver.quit()
Invoke the Python Script from Your PVD Script:

Use the FileExecute method to call the Python script, passing the URL and the output file path as arguments.

Example Pascal Script:

pascal
Quote
// Log the action
LogMessage(Chr(9) + Chr(9) + '      Download with Selenium in file:|' + ScriptPath + OutFile + ' the information of:|' + URL + '||');

// Construct the command to run the Python script
FileExecute('python.exe', '"' + ScriptPath + 'selenium_script.py" "' + URL + '" "' + ScriptPath + OutFile + '"');
How It Works
Python Script: The Python script (selenium_script.py) receives the URL and the output file path as command line arguments. It then uses Selenium to open the URL, waits for the trailers to load, and saves the fully rendered HTML to the specified output file.

Pascal Script: Your PVD script constructs the command to run the Python script with the necessary arguments (the URL and the output file path) and then executes it. This ensures that the correct URL is passed to Selenium dynamically.

This approach allows you to integrate Selenium seamlessly into your PVD script, enabling it to handle pages with dynamic content effectively. If you need further assistance with any part of the setup, feel free to ask!

How slow is this process?
The process of using Selenium to render and download the HTML can be slower than simply downloading a static page because Selenium has to:

Open a Web Browser: Launching a browser instance (e.g., Chrome or Firefox) takes a few seconds.

Navigate to the URL: Just like a human user, Selenium will navigate to the specified URL, which involves network latency.

Wait for JavaScript Execution: Selenium waits for the page to fully load, including the execution of any JavaScript that dynamically loads content. This can vary in time depending on the complexity of the page and the speed of your internet connection.

Save the HTML: Once the page is fully loaded, Selenium will extract and save the HTML content, which is relatively quick.

Estimated Time
Browser Launch: 1-3 seconds

Page Navigation: 1-5 seconds (depends on network speed)

JavaScript Execution & Content Load: 3-10 seconds (depends on page complexity)

Saving HTML: <1 second

In total, it can take anywhere from 5 to 20 seconds or more per page, depending on various factors.

Performance Considerations
Network Speed: Faster internet connections will reduce the navigation and loading time.

Page Complexity: Pages with heavy JavaScript and multiple dynamic elements will take longer to fully load.

Browser & System Performance: Using a faster browser and a system with more resources (CPU, RAM) will speed up the process.

While this is slower compared to a direct HTML download using PVdBDownPage.exe, it ensures that all dynamic content is fully loaded and accessible. This trade-off can be worthwhile if you need to scrape content that is loaded dynamically by JavaScript.

If speed becomes a critical issue, you might look into optimizing the Selenium script further, such as by reducing the wait times where possible or running multiple instances in parallel if applicable.


4
Scripts and Templates / Re: AdultDVDEmpire script
« on: December 07, 2024, 05:29:54 pm »
Hi guys,

I've released a new version of this scripts, there is little news (in fact almost none) about the data being entered but I have added the bypass for age verification and a function that checks at each startup if there are temporary files older than a certain time (I put 5 min) deletes them.

Plus of course to some bugfixes and other bugs...  ;D

Link to repo for download latest script:
https://raw.githubusercontent.com/NetworkShark/PersonalVideoDatabase_Scripts/refs/heads/main/AdultEmpire.com%5BEN%5D%5BHTTPS%5D.psf

Have Fun

5
Other Topics / Re: PVD Scripts in Notepad++
« on: November 24, 2024, 10:41:58 am »
I don't know how and where I got original PVD UserDefinedLanguage script for Notepad++, but now I tweaked it, together with Default Styler in Notepad++, so it's now easier for me to navigate through PVD scripts. Here are PVD UDL for .psf files, and tweaked default styler. You can import UDL Via Language->User Defined Language->Define Your Language->Import... and paste stylers.xml to appropriate folder for Notepad++ (The stylers.xml file for Notepad++ is typically located in the following directory ffor a normal installation: %AppData%\Notepad++\themes\stylers.xml and for a portable version: <installation_directory>\Data\Config\stylers.xml). You might want to backup your original stylers.xml, if you're not satisfied with the new Notepad++ look, and bring it back later. You can tweak the keys and colors of UDL to your likings at the place you imported it.


Choose the language PVD_UDL_final and choose Default(stylers.xml) in Settings->Style Configurator..., as in the picture 1, then restart Notepad++ and your PVD script should look like in the picture 2.

Great Job, finally I have Notepad++ which looks like an IDE for PVD!

6
Support / Re: List of custom fields
« on: November 20, 2024, 10:25:25 pm »
Here's the most comprehensive list of both original and custom fields through many active and obsolete scripts, I have managed to gather.


If you want me to add fields from any other script please do not post links to the script, upload script here, or I won't be able to add it to the list. That is how we will easier track which additional scripts are added.

Probably very interesting column is column D which shows if script field is active or inactive ("hidden") in the script, so you might want to change the script if you find some inactive field in active script for yourself, and you even didn't know it's there. Not every hidden field will import data when activated, be aware, so if you're not sure, better ask Ivek.

List of my custom films is in column H. Yours is different so change it carefully if you want to know which custom fields from scripts you don't have added to your PVD. For this filter either column B or G for "No Custom Field Added in PVD for This Field".


There are many formulas in the file in order to help automation of future adding, as well as for better visibility inside the file, so be careful with any customization especially because of array formulas and conditional formatting.


It would be nice someone to change exclusively in column B what types of PVD fields are those in column A, with "No Custom Field Added in PVD for This Field" in column B. Please do not change this in column G. I added column B intentionally for this change only.


If you have any question, I'll do my best to respond.

Great work, afrocuban!

7
Support / Re: Poster field
« on: November 16, 2024, 06:48:47 pm »
Hello again, Ivek. I'm further customizing your script. I've got that custom director's field, but I will not post the script until some point when I will be satisfied with the level of customization.


Now I'm working on a possibility to import more than one poster at once from the site (let's say, IMDb site). Do you by any chance know if database or PVD itself allows that, so with the proper script coding, multiple posters would be parsed and populated to poster field?

Hi afrocuban,
PVD can handle multiple images for the same type (poster, disk cover, etc..) but from scripts I have never been able to upload more than 1 per type.

8
Support / Re: Merging skins
« on: November 07, 2024, 03:00:41 pm »

My goal would be to add people photos (thumbnails) for people listed in Movie skin's movie record, similar like on the website(s). Is that possible, anyone has an idea?

Are you sure that in the people section have the images?
I did a quick check and I see the thubnails from the movie record

9
Talk / Re: Personal Video Database alternate Forum
« on: November 02, 2024, 09:41:38 am »
Personal Video Database alternate Forum
was offline for a while because php version was upgraded
SORRY I did not realized this for a long while
mia culpa
it is NOW online again in an upgraded version wich support the new php version

meriator

Personal Video Database alternate Forum it is NOW online again.

Can you unlock my account on this forum?
Yes, you can, if there is a registration and after I approve it and thus unlock your account on this forum.

Edited, the account is approved and the status is Administrator.

Thank you!

10
Talk / Re: Personal Video Database alternate Forum
« on: October 30, 2024, 11:14:53 am »
Personal Video Database alternate Forum
was offline for a while because php version was upgraded
SORRY I did not realized this for a long while
mia culpa
it is NOW online again in an upgraded version wich support the new php version

meriator

Personal Video Database alternate Forum it is NOW online again.

Can you unlock my account on this forum?

11
Support / Re: IMDb
« on: October 29, 2024, 06:26:02 pm »
IMDB_[EN][HTTPS]_Search -a script finds results on IMDb if results are searched for in the normal way as in the link below.

https://www.imdb.com/find/?s=tt&q=Beverly+Hills+Cop+Axel+F

The Imdb.com[EN][HTTPS]_Search script and the
IMDB_[EN][HTTPS]_Search -a script now return to finding results in the normal search results mode.

Imdb.com[EN][HTTPS]_Search script and
IMDB_[EN][HTTPS]_Search -a script is  attached.

IMDB_[EN][HTTPS]_Search -b script finds results on IMDb if results are searched in "Advanced Search" mode as in the link below.

https://www.imdb.com/search/title/?title=Beverly+Hills+Cop+Axel+F

The Imdb.com[EN][HTTPS]_Search_adv script and the
IMDB_[EN][HTTPS]_Search -b script now return to finding results in the "Advanced Search" results mode.

By the way:
The Imdb.com[EN][HTTPS]_Search_adv script is the same one that NetworkShark fixed last year.


Imdb.com[EN][HTTPS]_Search_adv script and
IMDB_[EN][HTTPS]_Search -b script is  attached.

Great Ivek, I was just now beginning to understand the problem  ;D

12
Support / Re: IMDb
« on: October 28, 2024, 07:11:27 pm »
Any help appreciated.

The source code for the url address has been changed a bit, I already know that. It may only be necessary to make a small correction to make IMDB_[EN][HTTPS]_Search -b script and IMDB_[EN][HTTPS]_Search -a script work again. I will have to wait a few days to fix this, because the days are shorter now and I have other more important things to do. I think that around November 1st it will be fixed and new versions of the script will be added.

Sorry, if I'm going to jump into this discussion, but how come there are various scripts?

13
Support / Re: Skins
« on: December 24, 2023, 09:55:04 am »
Here's my small contribution.


I used CG's Dark People skin and Dave C's gallery skin and combined them. I also rearranged some fields. Hopefully someone will find it useful.

Thanks a lot afrocuban

14
Support / Re: IMDb
« on: December 24, 2023, 09:53:54 am »
Thanks for help and fix bug.
I'm here to help! 8)



Thanks afrocuban, I made a bugfix to the script released a few posts earlier.
Now only if the urls are from IMDb are used for data retrieval.

Happy Holidays!


Thank you, NetworkShark and Ivek! Best wishes!


Just to let know NetworkShark, that his fix now works if there is one non-imdb link in it. But, if there are more than one links there (in my case, that would be AllMovie and FilmAffinity links), then it crashes PVD again.

Ivek's -b search script doesn't offer right result when multiple choices are, while -a search script offers the right movie among others.

Example:

http://www.imdb.com/title/tt0058724/


So, generally it looks -a search script is the best to use in order not to miss the result for the movie we are looking for.

Thank you afrocuban, you too!

I can't replicate the error, can you get the log?

Thanks,
Net

15
Support / Re: IMDb
« on: December 23, 2023, 09:39:46 am »
I got some diagnose. Please check what is happening if any url is stored in the url field and running IMDb search after that:


Code: [Select]
(12/22/2023 8:19:44 PM) Compiling script: Imdb.com[EN][HTTPS]_Search_adv.psf
(12/22/2023 8:19:44 PM) Script compiled successfully: Imdb.com[EN][HTTPS]_Search_adv.psf
[Hint] (501:2): Variable 'CURPOS' never used
[Hint] (501:2): Variable 'ENDPOS' never used
[Hint] (502:2): Variable 'TMP' never used
[Hint] (502:2): Variable 'SEARCHSTR' never used
[Hint] (499:10): Variable 'Result' never used
[Hint] (594:2): Variable 'DEBUG_POS1' never used
(12/22/2023 8:19:44 PM) Executing script binary
(12/22/2023 8:19:44 PM) Logging in...
(12/22/2023 8:19:44 PM) Called GetCodePage
(12/22/2023 8:19:44 PM) Called GetCurrentMode
(12/22/2023 8:19:44 PM) GetCurrentMode: Search
(12/22/2023 8:19:44 PM) Called GetDownloadURL - Mode: 0
(12/22/2023 8:19:44 PM)  * DownloadURL:
http://www.filmaffinity.com/en/film220449.html || * Mode: 1
(12/22/2023 8:19:44 PM) Called GetBaseURL
(12/22/2023 8:19:44 PM) Searching movie information for: Luxembourg, Luxembourg
(12/22/2023 8:19:44 PM) Called GetCurrentMode
(12/22/2023 8:19:44 PM) GetCurrentMode: Normal
(12/22/2023 8:19:44 PM) Called ParsePage - Normal Mode
(12/22/2023 8:19:44 PM) Called ParseMoviePage
(12/22/2023 8:19:44 PM) Called GetMovieTitle
(12/22/2023 8:19:44 PM) GetMovieTitle:
(12/22/2023 8:19:44 PM) GetMovieTitle =>
(12/22/2023 8:19:44 PM) Called GetMovieYear
(12/22/2023 8:19:44 PM) GetMovieYear =>
(12/22/2023 8:19:44 PM)  * Provider data info retreived Ok in 2023-12-22 20:19:44| (~Updated~)
(12/22/2023 8:19:44 PM) Finish

Thanks afrocuban, I made a bugfix to the script released a few posts earlier.
Now only if the urls are from IMDb are used for data retrieval.

Happy Holidays!

16
Other Topics / Re: At the end of 2023...
« on: December 21, 2023, 06:24:39 pm »
... PVD is revived once again due to Ivek's magical takes on AllMovie and IMDb scripts?   :)


I am following other catalogues and I can tell you that neither is updated as PVD. In general they all dies...

I agree, all of this is active thanks to Ivek!

... PVD is revived once again due to Ivek's magical takes on AllMovie and IMDb scripts?   :)


I am following other catalogues and I can tell you that neither is updated as PVD. In general they all dies...

Thank you very much. However, I try to do my best to keep the updates up to date, considering that both myself and my mother have health problems, and as a result, there is less time for updating scripts, or it takes longer to edit the update and then I make tests regarding the correct functioning of this updated script.

At the same time, all the best to all PVD users in the new year 2024.


Happy New Year to you, too!

17
Scripts and Templates / Re: Use BING as default search result
« on: December 14, 2023, 08:24:44 pm »
I always wondered why it has to be Bing, actually, but assumed it was some "axiom".

I think it was used because it was perhaps the best choice after google

18
Scripts and Templates / Re: Use BING as default search result
« on: December 13, 2023, 09:40:38 pm »
I think we would test others search engine like duckduckgo.com.

19
Other Topics / Re: Hello from NetworkShark
« on: November 10, 2023, 08:48:30 pm »
Thank you Ivek, thousand times!


I have recently decide anyway to use PVD in the future regardless of the scripts state. It's basic functionality to at least record file entries will stay unbroken

Thank you for your hope in this project!

I hope Nostra will be heard, and I hope this will happen soon.
This project has a very good chance to grow.

20
Talk / Re: Github & similar
« on: November 06, 2023, 08:48:38 am »
Well I have to give the opportunity to participate, but if it is as you say, then it will be just me and Ivek doing the scripts...  8)

I'd take that "just" as "just enough"! :)

oh yes  :'(

Pages: [1] 2