]>
git.mxchange.org Git - fba.git/log
Roland Häder [Sun, 28 May 2023 15:52:55 +0000 (17:52 +0200)]
Continued:
- maybe str(e) helps here and there?
Roland Häder [Sun, 28 May 2023 14:34:57 +0000 (16:34 +0200)]
Continued:
- only a warning, 'activitypub-troll.cf' is to much rampant
- skip to next entry in case if being blacklisted or no 'host' found
Roland Häder [Sun, 28 May 2023 13:19:41 +0000 (15:19 +0200)]
Continued:
- 100 rows should work! (the fail-safe check "fetched versus expected" will
kick in here)
- also read origin and pass over 'origin' during fetching instances
Roland Häder [Sun, 28 May 2023 12:06:53 +0000 (14:06 +0200)]
Continued:
- encapsulted into function add_peers()
- need to add "Content-Type: application/json" for API requests, thanks to Kromonos
- introduced 'api_headers' for JSON API requests
Roland Häder [Sun, 28 May 2023 11:16:07 +0000 (13:16 +0200)]
Continued:
- more consistency in table column names and also corresponding variables
Roland Häder [Sun, 28 May 2023 11:12:14 +0000 (13:12 +0200)]
Continued:
- res.json() can somtimes not being checked (?)
- also need to check each element as it might return 'None' causing error
Roland Häder [Sun, 28 May 2023 10:15:19 +0000 (12:15 +0200)]
Continued:
- some misskey instances only allow 10 items per fetch
- also used more masked strings
Roland Häder [Sun, 28 May 2023 09:36:18 +0000 (11:36 +0200)]
Continued:
- also ngrok-free.app is a testing/development ground, no productive/live
instances will be found there
- please don't abuse their kind services for hosting a live instance!
- didn't log variable "instance", ops
Roland Häder [Sun, 28 May 2023 09:24:42 +0000 (11:24 +0200)]
Continued:
- let also user install their software on sub paths ...
Roland Häder [Sun, 28 May 2023 09:19:13 +0000 (11:19 +0200)]
Continued:
- don't name your variables after packages, Python seem to be not strict on
checking data types while referencing
- also scan misskey instances for new (no filter applied)
Roland Häder [Sun, 28 May 2023 07:58:12 +0000 (09:58 +0200)]
Continued:
- these aren't supposed to be real URLs, they COULD be reached actually
- these URLs are references, not crawlable URLs
- so some people overdoze the SSL here a little, as http:// is just enough
for referencing to a specification
Roland Häder [Sat, 27 May 2023 13:29:54 +0000 (15:29 +0200)]
Continued:
- introduced fba.is_instance_registered()
Roland Häder [Sat, 27 May 2023 09:59:06 +0000 (11:59 +0200)]
Continued:
- also check for version numbers if no dot is in
Roland Häder [Sat, 27 May 2023 08:33:18 +0000 (10:33 +0200)]
Continued:
- including the configured base_url didn't make sense, let's better state
clearly what you will get (recently blocked instances)
Roland Häder [Sat, 27 May 2023 08:07:00 +0000 (10:07 +0200)]
Continued:
- ops, I need to check again API response code
- also variable 'json' isn't here, need to take 'res.json()' directly
Roland Häder [Sat, 27 May 2023 07:44:48 +0000 (09:44 +0200)]
Continued:
- better only temporary split semicolon off, not from input variable
Roland Häder [Sat, 27 May 2023 07:35:08 +0000 (09:35 +0200)]
Continued:
- added regex for hexa-decimal "version numbers"
- removed some extras from (software) generator before checking it against a regex
Roland Häder [Fri, 26 May 2023 15:10:52 +0000 (17:10 +0200)]
Continued:
- one character more to remove which cuts of the separator, e.g. '/'
- also don't raise exceptions here, a returned unmodified software name is just fine
Roland Häder [Fri, 26 May 2023 14:02:39 +0000 (16:02 +0200)]
Continued:
- also expose these RSS URLs for auto-discovery
Roland Häder [Fri, 26 May 2023 04:41:53 +0000 (06:41 +0200)]
Continued:
- also strip out " by " and " see " (self-advertisement)
- same with " version"
- some version numbers had uncommon long patch levels, e.g. 8.0.0000
Roland Häder [Fri, 26 May 2023 04:31:49 +0000 (06:31 +0200)]
Continued:
- also try to remove 'powered by ' and other self-advertisement from software
type
Roland Häder [Thu, 25 May 2023 23:24:01 +0000 (01:24 +0200)]
Continued:
- old Friendica installations (I found one with 2019.03) may have version
number in software's name info and in format YYYY.MM (and maybe later others)
Roland Häder [Thu, 25 May 2023 23:09:27 +0000 (01:09 +0200)]
Continued:
- an INFO message is okay, let the user know that the <meta name='generator'>
was found and taken as the software behind the instance/website
Roland Häder [Thu, 25 May 2023 22:02:20 +0000 (00:02 +0200)]
Continued:
- DEBUG is okay here
Roland Häder [Thu, 25 May 2023 21:34:57 +0000 (23:34 +0200)]
Continued:
- ops, need to include [domain] for nodeinfos array, too
- more flexible check if a domain has pending nodeinfos
Roland Häder [Thu, 25 May 2023 20:55:58 +0000 (22:55 +0200)]
Continued:
- return modified software name when no common separator was found
Roland Häder [Thu, 25 May 2023 20:52:56 +0000 (22:52 +0200)]
Continued:
- added 'GENERATOR' as "detection mode"
- explained NULL in 'detection_mode'
Roland Häder [Thu, 25 May 2023 20:07:18 +0000 (22:07 +0200)]
Continued:
- try to strip off version numbers from software name
- remove_version() will output a warning and return 'software' unmodified if it
fails to match version number against regex
Roland Häder [Thu, 25 May 2023 17:01:26 +0000 (19:01 +0200)]
Continued:
- encapsulated deleting of pending errors in remove_pending_error()
- introduced fetch_generator_from_path() and used it as last fall-back
Roland Häder [Wed, 24 May 2023 16:23:05 +0000 (18:23 +0200)]
Continued:
- "guessing" it didn't really work, let's take <meta name='generator'> as the
software type, some people even change that and have no /.well-known/ path
enabled. Then they cannot federate anymore, at least not by auto-discovery
Roland Häder [Wed, 24 May 2023 15:23:56 +0000 (17:23 +0200)]
Continued:
- after sorting by count, sort by software name
Roland Häder [Wed, 24 May 2023 14:46:55 +0000 (16:46 +0200)]
Continued:
- output fetched rows so it is clear it might take some time to finish
Roland Häder [Wed, 24 May 2023 13:11:47 +0000 (15:11 +0200)]
Continued:
- fixed debug message (ops)
- commented out noisy debug lines
Roland Häder [Wed, 24 May 2023 12:43:32 +0000 (14:43 +0200)]
Continued:
- tidyup() used for header.text
- moved translations table to header
- renamed variable
Roland Häder [Wed, 24 May 2023 12:20:20 +0000 (14:20 +0200)]
Continued:
- reformatted, I tried to use post_json_api() for bot postings
Roland Häder [Wed, 24 May 2023 12:16:35 +0000 (14:16 +0200)]
Continued:
- 'mode' must be used also for these elements
Roland Häder [Wed, 24 May 2023 10:34:43 +0000 (12:34 +0200)]
Continued:
- also including 'follower', not only 'following'
Roland Häder [Wed, 24 May 2023 10:21:54 +0000 (12:21 +0200)]
Continued:
- PeerTube has discoverable instances lists, so let us browse them
Roland Häder [Wed, 24 May 2023 06:40:12 +0000 (08:40 +0200)]
Continued:
- no shortening of packages
Roland Häder [Wed, 24 May 2023 05:23:50 +0000 (07:23 +0200)]
Continued:
- if fetching nodeinfo was successful, then clear any status codes and error
messages for this instance
- also tidyup() found instance, some are invalid :-(
Roland Häder [Tue, 23 May 2023 23:13:36 +0000 (01:13 +0200)]
Continued:
- ops, my mistake to remove array element while it is still needed
Roland Häder [Tue, 23 May 2023 19:08:39 +0000 (21:08 +0200)]
Continued:
- use validators.domain() for checking if it is a valid domain (and host name)
Roland Häder [Tue, 23 May 2023 18:02:26 +0000 (20:02 +0200)]
Continued:
- lemmy has an other API URL and also other JSON output for getting more peers
- Unfinished: Fetching blocks from Lemmy
Roland Häder [Tue, 23 May 2023 11:56:05 +0000 (13:56 +0200)]
Continued:
- if the software being used isn't regonized, skip them
Roland Häder [Tue, 23 May 2023 10:04:08 +0000 (12:04 +0200)]
Continued:
- also check if 'blocker' is meanwhile blocked (WIP)
Roland Häder [Tue, 23 May 2023 09:57:41 +0000 (11:57 +0200)]
Continued:
- these are system-specific service files
Roland Häder [Tue, 23 May 2023 08:48:59 +0000 (10:48 +0200)]
Continued:
- added more mastodon-typical fields
Roland Häder [Tue, 23 May 2023 07:10:42 +0000 (09:10 +0200)]
Continued:
- added get_peers_url column
- flush any pending data after get_peers() was invoked
- blacklisted already added instances should be removed (WIP)
Roland Häder [Tue, 23 May 2023 07:01:41 +0000 (09:01 +0200)]
Continued:
- flexible handling of SQL columns and values
Roland Häder [Tue, 23 May 2023 06:18:51 +0000 (08:18 +0200)]
Continued:
- added detection_mode and nodeinfo_url to table 'instances'
- described arrays
Roland Häder [Tue, 23 May 2023 04:43:17 +0000 (06:43 +0200)]
Continued:
- found a Japanese "limited server" phrase
Roland Häder [Tue, 23 May 2023 04:08:42 +0000 (06:08 +0200)]
Continued:
- also 'meisskey' is an alias of misskey software
Roland Häder [Tue, 23 May 2023 04:05:26 +0000 (06:05 +0200)]
Continued:
- proper URLs with base_url
- all JSON replies now start with /api/
Roland Häder [Mon, 22 May 2023 15:08:57 +0000 (17:08 +0200)]
Continued:
- reduce amount of rows to one (performance)
Roland Häder [Mon, 22 May 2023 09:23:57 +0000 (11:23 +0200)]
Continued:
- log entire exception message
- show number of errorous instances
Roland Häder [Mon, 22 May 2023 09:02:07 +0000 (11:02 +0200)]
Continued:
- try to update last error details and with a fake status_code
Roland Häder [Mon, 22 May 2023 05:05:15 +0000 (07:05 +0200)]
Continued:
- proper place
- also log status_code
Roland Häder [Mon, 22 May 2023 05:02:54 +0000 (07:02 +0200)]
Continued:
- older versions are okay, too!
Roland Häder [Mon, 22 May 2023 04:59:42 +0000 (06:59 +0200)]
Continued:
- both isn't fully supported, only peer-disovery works
Roland Häder [Mon, 22 May 2023 04:52:05 +0000 (06:52 +0200)]
Continued:
- Takahe added (partly WIP)
Roland Häder [Mon, 22 May 2023 04:18:01 +0000 (06:18 +0200)]
Continued:
- strip out pipes from software names, too
Roland Häder [Mon, 22 May 2023 04:02:53 +0000 (06:02 +0200)]
Continued:
- added other node rel
Roland Häder [Mon, 22 May 2023 03:28:19 +0000 (05:28 +0200)]
Continued:
- also log these errors
Roland Häder [Mon, 22 May 2023 02:58:41 +0000 (04:58 +0200)]
Continued:
- first try auto-discovery, then static list
Roland Häder [Mon, 22 May 2023 02:52:13 +0000 (04:52 +0200)]
Continued:
- fetching .well-known/nodeinfo for auto-discovery added
Roland Häder [Mon, 22 May 2023 02:13:27 +0000 (04:13 +0200)]
Continued:
- also include bookwyrm in instance discovering (blocks is WIP)
Roland Häder [Mon, 22 May 2023 02:11:48 +0000 (04:11 +0200)]
Continued:
- added support for very old Mastodon instances, e.g. 2.x has no "software" in nodeinfo
Roland Häder [Mon, 22 May 2023 01:35:28 +0000 (03:35 +0200)]
Continued:
- fixed bad handling of errors during nodeinfo discovery
Roland Häder [Mon, 22 May 2023 01:00:24 +0000 (03:00 +0200)]
Continued:
- sorted nodeinfo API URLs by version
- added 1.0
Roland Häder [Mon, 22 May 2023 00:42:07 +0000 (02:42 +0200)]
Continued:
- renamed variable
- used more tidyup()
Roland Häder [Mon, 22 May 2023 00:11:50 +0000 (02:11 +0200)]
Continued:
- try to strip of path if not aliased and known
- fixed some errors
Roland Häder [Sun, 21 May 2023 23:57:02 +0000 (01:57 +0200)]
Continued:
- nicer look
- tidy up software, too
Roland Häder [Sun, 21 May 2023 17:36:42 +0000 (19:36 +0200)]
Continued:
- no local variable needed
Roland Häder [Sun, 21 May 2023 16:03:23 +0000 (18:03 +0200)]
Continued:
- keep connection and read timeout separate
Roland Häder [Sun, 21 May 2023 15:18:38 +0000 (17:18 +0200)]
Continued:
- don't shorten variable names, wrong lazyness
Roland Häder [Sun, 21 May 2023 14:59:12 +0000 (16:59 +0200)]
Continued:
- res.reason is just a text respresentation of the code
Roland Häder [Sun, 21 May 2023 14:31:43 +0000 (16:31 +0200)]
Continued:
- also log last detailed error message
- introduced fba.is_blacklisted()
Roland Häder [Sun, 21 May 2023 09:56:32 +0000 (11:56 +0200)]
Continued:
- make all software names lower-case
- also include initial instance
Roland Häder [Sun, 21 May 2023 09:38:41 +0000 (11:38 +0200)]
Continued:
- added top X for used software
Roland Häder [Sun, 21 May 2023 09:27:16 +0000 (11:27 +0200)]
Continued:
- rewrote a bit
Roland Häder [Sun, 21 May 2023 07:51:48 +0000 (09:51 +0200)]
Continued:
- added top-referencing instances scoreboard
Roland Häder [Sun, 21 May 2023 07:36:08 +0000 (09:36 +0200)]
Continued:
- added first_seen and last_updated to instances table
- moved timestamp creation to functions (lesser parameter)
Roland Häder [Sun, 21 May 2023 05:16:23 +0000 (07:16 +0200)]
Continued:
- reformatted config.defaults.json
- also store script name
Roland Häder [Sat, 20 May 2023 14:30:40 +0000 (16:30 +0200)]
Continued:
- it is more the originator ...
Roland Häder [Sat, 20 May 2023 13:07:40 +0000 (15:07 +0200)]
Continued:
- added origin to track which instance has referenced this one
Roland Häder [Sat, 20 May 2023 12:36:59 +0000 (14:36 +0200)]
Continued:
- looks cooler this way
Roland Häder [Sat, 20 May 2023 12:26:02 +0000 (14:26 +0200)]
Continued:
- only fetch these, others are WIP
Roland Häder [Sat, 20 May 2023 11:31:29 +0000 (13:31 +0200)]
Continued:
- COUNT() is a SQL function, also always upper-case
Roland Häder [Sat, 20 May 2023 11:12:05 +0000 (13:12 +0200)]
Continued:
- added last_blocked to reduce requering instances to often (be nice!)
Roland Häder [Sat, 20 May 2023 11:00:09 +0000 (13:00 +0200)]
Continued:
- try block must be around software type checking, not the reqto.foo() call
Roland Häder [Sat, 20 May 2023 10:32:57 +0000 (12:32 +0200)]
Continued:
- some improvements
Roland Häder [Sat, 20 May 2023 08:50:32 +0000 (10:50 +0200)]
Continued:
- always update last_nodeinfo
Roland Häder [Sat, 20 May 2023 08:45:53 +0000 (10:45 +0200)]
Continued:
- renamed last_access to last_nodeinfo as fetch_blocks will have its own
last access timestamp
Roland Häder [Sat, 20 May 2023 08:33:34 +0000 (10:33 +0200)]
Continued:
- be kind! set recheck_instance to at least 3600 seconds (one hour)
Roland Häder [Sat, 20 May 2023 08:26:29 +0000 (10:26 +0200)]
WIP:
- introduced update_last_access() which will update 'last_access' by instance's
domain
Roland Häder [Sat, 20 May 2023 08:14:56 +0000 (10:14 +0200)]
Continued:
- rewrote fetch_nodeinfo() to test all URLs
Roland Häder [Sat, 20 May 2023 07:39:57 +0000 (09:39 +0200)]
Continued:
- introduced fba.fetch_nodeinfo()
Roland Häder [Sat, 20 May 2023 07:04:54 +0000 (09:04 +0200)]
Continued:
- some instances reported others with '@' in it which isn't correct for a
domain name/host IP
Roland Häder [Sat, 20 May 2023 06:44:44 +0000 (08:44 +0200)]
Continued:
- also show domain and software (mostly 'None')
Roland Häder [Sat, 20 May 2023 06:19:22 +0000 (08:19 +0200)]
WIP:
- blocked spam domain
- also discover more instances by quering already existing instances