]>
git.mxchange.org Git - fba.git/log
Roland Häder [Sat, 20 Jul 2024 18:29:23 +0000 (20:29 +0200)]
Continued:
- don't reduce counter for obfuscated domains, keep it at original value
- some more debug messages added
Roland Häder [Thu, 18 Jul 2024 22:16:38 +0000 (00:16 +0200)]
Continued:
- split off question mark (mostly comes from WordPress instances)
Roland Häder [Wed, 17 Jul 2024 18:07:17 +0000 (20:07 +0200)]
Continued:
- skip unset software (None)
Roland Häder [Sat, 13 Jul 2024 20:52:35 +0000 (22:52 +0200)]
Continued:
- removed never started feature to check local files containing blocked
instances (and maybe reasons)
Roland Häder [Sat, 13 Jul 2024 20:49:32 +0000 (22:49 +0200)]
Continued:
- annotated more functions for caching
- initialized private _config "array" (dict)
Roland Häder [Sat, 13 Jul 2024 14:18:34 +0000 (16:18 +0200)]
Continued:
- skip invalid/unwanted domains (like .ip2/tld/onion ...)
- proper name logged
Roland Häder [Sat, 13 Jul 2024 13:12:30 +0000 (15:12 +0200)]
Continued:
- ops, forgot to add variable 'domain'
Roland Häder [Sat, 13 Jul 2024 09:21:44 +0000 (11:21 +0200)]
Continued:
- invalid domains should most be logged as warnings, not debug levels but
sometimes it is okay
- allow RFC 2782 compliant domains (with underscore)
Roland Häder [Sat, 13 Jul 2024 09:19:07 +0000 (11:19 +0200)]
Continued:
- skip invalid domains (warning)
- skip .arpa/i2p/onion/tld (I2P might be allowed)
Roland Häder [Fri, 12 Jul 2024 09:53:38 +0000 (11:53 +0200)]
Continued:
- ops, renamed to much
Roland Häder [Wed, 10 Jul 2024 20:54:21 +0000 (22:54 +0200)]
Continued:
- a warning was to much here
- also need to check on other invalid TLDs, like .arpa, .tld and (if disabled)
.onion/.i2p TLDs
Roland Häder [Wed, 10 Jul 2024 18:54:25 +0000 (20:54 +0200)]
Continued:
- allowed underscore in domains (RFC 2782)
- have to check domain validity before domain_helpers.encode_idna() is being
invoked
Roland Häder [Wed, 10 Jul 2024 16:34:05 +0000 (18:34 +0200)]
Continued:
- introduced function domain_helper.encode_idna() which has centralized IDNA
encoding
Roland Häder [Tue, 9 Jul 2024 23:09:45 +0000 (01:09 +0200)]
Continued:
- added 'wordpress' and 'oolong' as peer sources
- some APIs may return a dictionary (dict) for an instance entry
Roland Häder [Tue, 9 Jul 2024 22:34:12 +0000 (00:34 +0200)]
Continued:
- added "和歌" as alias for mastodon
Roland Häder [Tue, 9 Jul 2024 22:23:14 +0000 (00:23 +0200)]
Continued:
- added 'devtunnels.ms' to blacklist as this is only for testing/developing
Roland Häder [Mon, 8 Jul 2024 21:09:27 +0000 (23:09 +0200)]
Continued:
- removed redundant dashes
Roland Häder [Sun, 7 Jul 2024 21:18:20 +0000 (23:18 +0200)]
Continued:
- added 'kookie' as another alias for 'misskey'
Roland Häder [Sun, 7 Jul 2024 20:13:23 +0000 (22:13 +0200)]
Continued:
- moved args.single handling to proper level
- wrapped strings in Shell script
- explained all entries in .gitignore
Roland Häder [Mon, 1 Jul 2024 21:58:25 +0000 (23:58 +0200)]
Continued:
- stop question marks in domain.raise_on() function
- split off question mark
- added space for better readability
- fixed syntax error
Roland Häder [Mon, 1 Jul 2024 16:21:54 +0000 (18:21 +0200)]
Continued:
- WordPress is a peer list provider and can be utilized for more "peers" aka.
instances
- URL from response object can be different than requested, it needs to be
revalidated
- invalid redirect URLs are now logged with a level WARNING message
Roland Häder [Sun, 23 Jun 2024 11:28:52 +0000 (13:28 +0200)]
Continued:
- 'neodb' might appear in names such as 'testando neodb'
Roland Häder [Sat, 15 Jun 2024 13:11:40 +0000 (15:11 +0200)]
Continued:
- if the found digest isn't valid, then set it to None
- ignore (not skip) any invalid digests
Roland Häder [Fri, 14 Jun 2024 16:48:25 +0000 (18:48 +0200)]
Continued:
- fixed exception, need to check is_registered() here first
Roland Häder [Sun, 2 Jun 2024 14:53:24 +0000 (16:53 +0200)]
Continued:
- show run-mode of nodeinfo.sh
- fetch_instances can now (again) fetch new instances (not registered)
Roland Häder [Sat, 1 Jun 2024 19:21:39 +0000 (21:21 +0200)]
Continued:
- added another UTF-8 exception
Roland Häder [Sat, 25 May 2024 16:55:09 +0000 (18:55 +0200)]
Continued:
- added another exception
Roland Häder [Mon, 20 May 2024 19:23:31 +0000 (21:23 +0200)]
Continued:
- Hugo isn't distributed at all. I guess someone replaced a federated software
with Hugo at the same domain
Roland Häder [Mon, 20 May 2024 14:36:51 +0000 (16:36 +0200)]
Continued:
- added 'biblioreads' as peer list provider
Roland Häder [Mon, 20 May 2024 12:49:13 +0000 (14:49 +0200)]
Continued:
- variable `response_url` is the whole URL, maybe stripped off from its anchor
(#) part but it is NEVER a valid domain (ops)
Roland Häder [Mon, 20 May 2024 12:33:25 +0000 (14:33 +0200)]
Continued:
- some servers may return '<html something>' at the start (position 0),
- so if string.find() cannot find that string "<html", it will return -1 instead
of zero
Roland Häder [Sat, 11 May 2024 19:36:59 +0000 (21:36 +0200)]
Continued:
- dict element 'domain' will always be there, no double-redundant checks
Roland Häder [Sat, 11 May 2024 19:30:26 +0000 (21:30 +0200)]
Continued:
- fetch_blocks_from_about() does return a 2-dimensional array, not a flat one
Roland Häder [Fri, 10 May 2024 20:56:07 +0000 (22:56 +0200)]
Continued:
- network aka. "universe" 'snac' added as peer source
Roland Häder [Sat, 4 May 2024 00:05:29 +0000 (02:05 +0200)]
Continued:
- addedd alias 'cutiekey' for 'misskey'
Roland Häder [Thu, 2 May 2024 01:23:12 +0000 (03:23 +0200)]
Continued:
- serveo.net blacklisted: 'Expose local servers to the internet'
Roland Häder [Tue, 30 Apr 2024 13:12:54 +0000 (15:12 +0200)]
Continued:
- nodeinfo_url should never be None/NULL here
Roland Häder [Tue, 23 Apr 2024 23:46:57 +0000 (01:46 +0200)]
Continued:
- blocked 'documentation.on.seirdy.one'
- removed old-lost blocklists
- added another CSV file for seirdy.one
Roland Häder [Sun, 21 Apr 2024 17:49:16 +0000 (19:49 +0200)]
Continued:
- rewrite all "*mammuthus*" to "mammuthus"
Roland Häder [Fri, 19 Apr 2024 23:41:39 +0000 (01:41 +0200)]
Continued:
- recheck_obfuscation now supports deleting unwanted domains from both tables
'instances' and 'blocks'
Roland Häder [Fri, 19 Apr 2024 15:41:20 +0000 (17:41 +0200)]
Continued:
- moved some pre-sorting
Roland Häder [Thu, 18 Apr 2024 17:23:45 +0000 (19:23 +0200)]
Continued:
- added alias 'trashposs' for 'misskey'
Roland Häder [Wed, 17 Apr 2024 20:50:46 +0000 (22:50 +0200)]
Continued:
- added uncached blocks.get_reason()
- update block reason when current isn't set but updated reason is set
Roland Häder [Wed, 17 Apr 2024 19:38:13 +0000 (21:38 +0200)]
Continued:
- renamed local (overwritten) variable 'blocked' to 'cleared' so the array
index can be found again
Roland Häder [Wed, 17 Apr 2024 18:10:06 +0000 (20:10 +0200)]
Continued:
- need to check if variable 'blocked' is None or an empty string first
- log size (len()) not whole array
Roland Häder [Tue, 16 Apr 2024 22:26:14 +0000 (00:26 +0200)]
Continued:
- added some missing instances.is_recent(foo) invocations
- use last_instance_fetch (default) where ever federation.fetch_instances() is
been invoked
- wording fixed
Roland Häder [Tue, 16 Apr 2024 22:08:39 +0000 (00:08 +0200)]
Continued:
- added alias 'tanukey' for 'misskey'
Roland Häder [Tue, 16 Apr 2024 14:23:50 +0000 (16:23 +0200)]
Continued:
- blacklisted 'clients.your-server.de' and 2 others as this is a reverse-IP host
name, but instances should have proper domain names
Roland Häder [Sun, 14 Apr 2024 09:54:15 +0000 (11:54 +0200)]
Continued:
- fixed 'TypeError: not enough arguments for format string'
Roland Häder [Wed, 10 Apr 2024 21:53:04 +0000 (23:53 +0200)]
Continued:
- cache blacklist function invocations
Roland Häder [Wed, 10 Apr 2024 21:38:28 +0000 (23:38 +0200)]
Continued:
- added 'toki' for peer lists, you can find it here:
https://github.com/purifetchi/Toki
Roland Häder [Wed, 10 Apr 2024 09:56:11 +0000 (11:56 +0200)]
Continued:
- python3-aiofiles needs to be installed manually with Python 3.9
- Python 3.11 seem to install it automatically
Roland Häder [Tue, 9 Apr 2024 16:11:37 +0000 (18:11 +0200)]
Continued:
- added missing keyword (ops)
Roland Häder [Tue, 2 Apr 2024 01:39:32 +0000 (03:39 +0200)]
Continued:
- removed entries as the list has been deleted by oliphant + they provide
their own version already
Roland Häder [Mon, 1 Apr 2024 22:56:35 +0000 (00:56 +0200)]
Continued:
- FBA now requires Python 3.11
- updated documentation accordingly
- removed fastapi and uvicorn requirements, your distribution must now provide
it
- removed jinja2 and requests
- ignore entire venv/ directory, contains local paths
Roland Häder [Mon, 1 Apr 2024 03:12:17 +0000 (05:12 +0200)]
Continued:
- updated README file to reflect virtual environment for Python 3.11
- README file is a bit better
Roland Häder [Mon, 1 Apr 2024 03:01:19 +0000 (05:01 +0200)]
Continued:
- switched from global path to virtual-environment path
Roland Häder [Mon, 1 Apr 2024 02:49:23 +0000 (04:49 +0200)]
Continued:
- Python 3.11 needs a virtual environment
- so please add
source "${HOME}/fba/venv/bin/activate"
... to your ~/.bashrc
Roland Häder [Sun, 31 Mar 2024 21:05:47 +0000 (23:05 +0200)]
Continued:
- reduced duplicated code into a local variable 'response_url'
- need to recheck validity of 'response_url'
Roland Häder [Sun, 31 Mar 2024 08:00:43 +0000 (10:00 +0200)]
Continued:
- logged types (string or 'None' type)
Roland Häder [Sun, 31 Mar 2024 05:01:10 +0000 (07:01 +0200)]
Continued:
- handled ParserRejectedMarkup properly
Roland Häder [Sun, 31 Mar 2024 01:09:40 +0000 (03:09 +0200)]
Continued:
- added alias 'nexkey' for 'misskey'
Roland Häder [Wed, 20 Mar 2024 17:09:34 +0000 (18:09 +0100)]
Continued:
- added 'piefed' as possible source for block lists, still the network lacks
API for instance list retrival
Roland Häder [Sun, 17 Mar 2024 00:07:51 +0000 (01:07 +0100)]
Continued:
- that Wiki page was listing "evil instances" and now explains the view of the
owner of what FediBlock is
Roland Häder [Tue, 12 Mar 2024 12:50:39 +0000 (13:50 +0100)]
Continued:
- added 'hugo' as a source for more peers (only one so far)
Roland Häder [Tue, 12 Mar 2024 10:27:49 +0000 (11:27 +0100)]
Continued:
- added new aliases for pleroma
- moved pleroma aliases to "private" module variable
Roland Häder [Mon, 11 Mar 2024 21:28:00 +0000 (22:28 +0100)]
Continued:
- added aliases for misskey, mastodon and pleroma (each one)
- moved mastodon aliases to module ("private") variable, please don't access it
outside the module!
Roland Häder [Mon, 11 Mar 2024 09:18:12 +0000 (10:18 +0100)]
Continued:
- fixed SQL string
- partly reverted because simple code like 'tuple(row)' doesn't work
Roland Häder [Sun, 10 Mar 2024 10:57:12 +0000 (11:57 +0100)]
Continued:
- need to convert type Row to a true tuple
Roland Häder [Sun, 10 Mar 2024 10:53:17 +0000 (11:53 +0100)]
Continued:
- row[] does already contain named keys, e.g. 'total_websites' so let's alias
the COUNT() statement to such key name which reduces some code
- renamed 'blocks_recorded' to 'total_blocks'
Roland Häder [Sun, 10 Mar 2024 10:35:19 +0000 (11:35 +0100)]
Continued:
- renamed known_instances to total_websites because this is more clear
- you cannot distinguish between a regular website and a former Fediverse
instance (e.g. before: Mastodon was installed, now IBM Connections)
Roland Häder [Sat, 9 Mar 2024 20:16:10 +0000 (21:16 +0100)]
Continued:
- check 'json' key
Roland Häder [Fri, 8 Mar 2024 15:43:10 +0000 (16:43 +0100)]
Continued:
- added missing key 'json' (ops!)
Roland Häder [Sun, 3 Mar 2024 03:37:26 +0000 (04:37 +0100)]
Continued:
- added
7988276 .xyz as a testing/development hoster
Roland Häder [Sun, 3 Mar 2024 03:25:11 +0000 (04:25 +0100)]
Continued:
- added another alias "hijikey" for "misskey"
Roland Häder [Fri, 1 Mar 2024 06:36:44 +0000 (07:36 +0100)]
Continued:
- moved --same from nodeinfo.sh to command update_nodeinfo()
Roland Häder [Wed, 28 Feb 2024 17:12:22 +0000 (18:12 +0100)]
Continued:
- also log parameter 'column'
- blocks.add() should not be invoked with 'blocker' is already blocking
'blocked' at 'block_level'
Roland Häder [Sun, 25 Feb 2024 02:43:29 +0000 (03:43 +0100)]
Continued:
- added another gardenfence blocklist for bootstrapping FBA
- just run ./fba.py fetch_txt and you can build up an initial list of peers
Roland Häder [Thu, 22 Feb 2024 23:38:57 +0000 (00:38 +0100)]
Continued:
- more pre-checks to avoid exceptions
- don't abuse catching them here as a control statement (if)
Roland Häder [Thu, 15 Feb 2024 17:58:51 +0000 (18:58 +0100)]
Continued:
- added network type 'vebinet' (Mozilla's Fediverse software) for peer retrival
- domain blocks aren't supported (yet)
Roland Häder [Thu, 15 Feb 2024 13:30:07 +0000 (14:30 +0100)]
Continued:
- first simple variable checks
- next "lighter" functions being invoked
- last "heavy" function, which means possible database queries
Roland Häder [Tue, 13 Feb 2024 18:31:31 +0000 (19:31 +0100)]
Continued:
- added some more plain-text block lists (thanks to @Kromonos)
Roland Häder [Mon, 12 Feb 2024 15:45:46 +0000 (16:45 +0100)]
Continued:
- check parameters first, then "expensive" function invocation's returned value
- command fetch_instances should always fetch instances from a provided domain
name
- only when --software=bar is used and no --force parameter was given
Roland Häder [Thu, 8 Feb 2024 17:00:57 +0000 (18:00 +0100)]
Continued:
- make --domain parameter for command fetch_instances optional so --software
can be handled, too
- skip recently crawled domains in same loop
- args.<domain|software> can be both of type None, too
Roland Häder [Wed, 7 Feb 2024 19:26:24 +0000 (20:26 +0100)]
Continued:
- github isn't any fediverse instance
- https://github.io redirects to a bad URL
Roland Häder [Wed, 7 Feb 2024 07:55:13 +0000 (08:55 +0100)]
Continued:
- it was imported from previous code and seem not to work everywhere
- so let's remove the non-working part
Roland Häder [Sun, 4 Feb 2024 08:29:38 +0000 (09:29 +0100)]
Continued:
- added alias 'rosekey' for 'misskey'
- also 'smithereen' is a federating (at least peer list is provided) software
Roland Häder [Sat, 3 Feb 2024 19:39:39 +0000 (20:39 +0100)]
Continued:
- maybe there is no unsorted list or table at all
Roland Häder [Wed, 24 Jan 2024 17:14:14 +0000 (18:14 +0100)]
Continued:
- 'continue' if an exception is thrown because that instance can be "ignored"
Roland Häder [Wed, 24 Jan 2024 05:18:53 +0000 (06:18 +0100)]
Continued:
- handled unregistered instances
Roland Häder [Tue, 16 Jan 2024 19:18:16 +0000 (20:18 +0100)]
Continued/WIP:
- in commands.fetch_instances() added initialization of variables 'rows'
- also moved fetching rows into if() block
- commented some more code
Roland Häder [Tue, 16 Jan 2024 02:48:35 +0000 (03:48 +0100)]
Continued:
- added alias "kmyblue" for mastodon
Roland Häder [Sun, 14 Jan 2024 01:25:10 +0000 (02:25 +0100)]
WIP:
- rewritten fetch_instances towars allowing also --software=foo as an
alternative parameter
Roland Häder [Fri, 12 Jan 2024 09:00:40 +0000 (10:00 +0100)]
Continued:
- changed string "true/false" to boolean
- please update your configuration file (all occurances!)
Roland Häder [Fri, 12 Jan 2024 03:56:07 +0000 (04:56 +0100)]
Continued:
- need to negate this state: only skip/reject .i2p domains when they are not
allowed by configuration (which is default)
Roland Häder [Wed, 10 Jan 2024 21:52:50 +0000 (22:52 +0100)]
Continued:
- moved utils.fetch_url() to module network as this is network-related
- added some debug lines
Roland Häder [Tue, 2 Jan 2024 19:45:51 +0000 (20:45 +0100)]
Continued:
- need to remove prefix "re:" before cleaning up software, else all software is called "re"
- added alias "lovers" for "misskey"
Roland Häder [Tue, 2 Jan 2024 19:30:01 +0000 (20:30 +0100)]
Continued:
- added alias "miraiskey" for "misskey"
- moved all those aliases to "private" variable _misskey_aliases
Roland Häder [Sat, 23 Dec 2023 16:10:59 +0000 (17:10 +0100)]
Continued:
- fediverse.observer has changed their API to Graph (JSON POST)
- domain singleuser.club blacklisted, this domain started flooding with
sub-domains which have wwXX as another sub-domain
Roland Häder [Fri, 22 Dec 2023 07:44:52 +0000 (08:44 +0100)]
Continued:
- need to cut off everything after hash symbol because that is for JavaScript
click-event loaded content anyway
- prevented a few empty/None strings for invoking tidyup.domain()
- improved a few log messages