Commit Graph

335 Commits

Author SHA1 Message Date
Username
9ed3287e41 ci: add gitea workflow for syntax and memory leak checks
- Python 2/3 syntax validation
- Static analysis for memory leak patterns
- Unbounded collection detection
- Circular reference checks
2025-12-24 00:19:13 +01:00
Username
f2e6b8216b httpd: fix dashboard accuracy and add memory tracking
- Add from __future__ import division
- Change "Tests This Session" to "Tests (Cumulative)"
- Fix Tor status to show IDLE when available but 0% success
- Add memory leak detection: RSS growth, peak, GC stats
2025-12-24 00:19:05 +01:00
Username
de750a1312 proxywatchd: fix division, uptime, and scaler
- Add from __future__ import division for correct percentages
- Remove start_time restoration so uptime reflects current session
- Remove success_rate threshold for scaling (scale on queue depth only)
2025-12-24 00:17:47 +01:00
Username
738fccdabb connection_pool: fix python 2 integer division
Add from __future__ import division to fix success_rate
and latency calculations returning 0 due to integer division.
2025-12-24 00:17:40 +01:00
Username
f83733dd46 proxywatchd: mark confirmed-dead proxies as permanently dead
- Add DEAD_PROXY=-1 constant for permanently dead proxies
- Mark proxy dead when: failed >= max_fail*2, or max_fail with fatal error
- Fatal errors: refused, unreachable, auth (proxy definitely not working)
- Dead proxies excluded from testing (failed >= 0 query)
- Cleanup_stale also removes old dead proxies
- Dashboard shows separate dead vs failing counts
2025-12-23 18:03:01 +01:00
Username
1e2054bec5 dashboard: fix missing tor pool and db stats fields 2025-12-23 17:56:12 +01:00
Username
6076d93377 docs: update roadmap and todo with dashboard v2 features 2025-12-23 17:48:59 +01:00
Username
53f37510f3 dashboard: add system monitoring and enhanced stats
- prominent check type badge in header (SSL/judges/http/irc)
- system monitor bar: load, memory, disk, process RSS
- anonymity breakdown: elite/anonymous/transparent counts
- database health: size, recent activity, dead proxy count
- enhanced Tor pool stats: requests, success rate, latency
- SQLite ANALYZE/VACUUM functions for query optimization
- database statistics API functions
2025-12-23 17:47:12 +01:00
Username
20fc1b01fd add export.py for proxy list export
CLI tool for exporting working proxies:
- multiple formats: txt, json, csv, length-prefixed
- filters: protocol, country, anonymity, max latency
- sorting: latency, added time, success count
- configurable output limit

Also update .gitignore to exclude data/ directory
2025-12-23 17:34:51 +01:00
Username
eb1bba0e13 docs: update roadmap and task tracking
- README: update feature list
- ROADMAP: add completed features, update priorities
- TODO: mark completed tasks, add new items
- config.ini.sample: update example values
- http2: minor cleanup
2025-12-23 17:24:25 +01:00
Username
1c437d2246 httpd: modern dashboard with gevent WSGI
- dark theme with CSS custom properties
- responsive grid layout for stats cards
- sparkline charts for rate/success history
- gevent pywsgi server with BaseHTTPServer fallback
- new API endpoints: /api/stats, /api/judges, /api/protocols
- protocol breakdown, latency percentiles, geo distribution
- tor pool health, judge availability, engine stats
- auto-refresh with configurable interval
2025-12-23 17:23:58 +01:00
Username
c0dbba7b45 proxywatchd: gevent concurrency and comprehensive stats
Major features:
- gevent monkey-patching for cooperative concurrency
- Stats class with latency percentiles, histograms, geo tracking
- session state persistence across restarts
- JudgeStats with cooldown for blocked/rate-limited judges
- ThreadScaler for dynamic greenlet pool sizing
- SIGHUP handler for config hot-reload
- SSL/TLS tracking with MITM detection
- anonymity detection (transparent/anonymous/elite)

Bug fixes:
- rwip(): fix string comparison (b[0] == '0' not == 0)
- rwip(): fix strip logic (b[1:] not b[:1])
- use string.ascii_letters for Py2/3 compatibility
2025-12-23 17:23:45 +01:00
Username
e7478de79e scraper: add engine stats API for dashboard
- EngineTracker.get_stats() returns detailed per-engine metrics
- get_scraper_stats() module function for external access
- includes success counts, backoff status, availability
2025-12-23 17:23:28 +01:00
Username
68a34f2638 fetch: detect proxy protocol from source URL path
- detect_proto_from_path() infers socks4/socks5/http from URL
- extract_proxies() now returns (address, proto) tuples
- ppf.py updated to handle protocol-tagged proxies
- profiler signal handler for SIGTERM stats dump
2025-12-23 17:23:17 +01:00
Username
e0e330301a dbs: add session persistence and stats history
- session_state table for persisting runtime stats across restarts
- stats_history table for hourly snapshots (24h graphs)
- latency tracking with exponential moving average
- anonymity detection columns (transparent/anonymous/elite)
- curated PROXY_SOURCES list for seeding
- migration functions for existing databases
2025-12-23 17:23:04 +01:00
Username
71fb3800ee config: add gevent dependency and min_threads option
- requirements.txt: add gevent for cooperative concurrency
- config.py: add min_threads setting for thread scaling
2025-12-23 17:22:52 +01:00
Username
267035802a ppf: reset stale_count when content hash changes 2025-12-22 00:05:06 +01:00
Username
f382a4ab6a ppf: add content hash for duplicate proxy list detection 2025-12-22 00:03:12 +01:00
Username
6b5eb83bf4 fetch: add robust proxy string validation 2025-12-21 23:49:02 +01:00
Username
73192311f3 docs: update dockerfile and readme 2025-12-21 23:38:04 +01:00
Username
01d5dfd477 minor: cleanup comboparse and http2 2025-12-21 23:37:59 +01:00
Username
9e7c8d78b3 fetch: unify known proxies cache 2025-12-21 23:37:58 +01:00
Username
747e6dd7aa ppf: improve exception handling and logging 2025-12-21 23:37:57 +01:00
Username
901f2c1aee httpd: improve api error handling 2025-12-21 23:37:49 +01:00
Username
b88aa2a878 scraper: add multi-engine support and tracking 2025-12-21 23:37:48 +01:00
Username
0274b84af8 engines: improve search engine rate limiting 2025-12-21 23:37:48 +01:00
Username
00623f3a18 connection_pool: add health tracking and backoff 2025-12-21 23:37:39 +01:00
Username
77867d0b2d dbs: add latency columns and migration 2025-12-21 23:37:38 +01:00
Username
2e1d9b7d3f gitignore: add database and data file patterns 2025-12-21 23:37:38 +01:00
Username
e2ef1b7e36 docs: mark geolocation and ssl testing as completed 2025-12-21 23:37:23 +01:00
Username
95bafcacff proxywatchd: add startup logging, fix geolocation error handling 2025-12-21 23:37:19 +01:00
Username
b6c0a89afd add IP2Location for geolocation 2025-12-21 23:37:13 +01:00
Username
d48cc3f9eb config: fix defaults for database and checktype 2025-12-21 23:37:09 +01:00
Username
8718d33276 fix: use canonical schema from dbs.py in proxywatchd 2025-12-21 10:30:31 +01:00
Username
f4e242fc18 add systemd unit for rootless podman container 2025-12-21 10:23:27 +01:00
Username
55bc9a635e docs: add README and update ROADMAP
- README.md: installation, configuration, usage, deployment
- ROADMAP.md: mark completed items (pooling, scaling, latency, containers)
- priority matrix updated with completion status
2025-12-21 10:19:18 +01:00
Username
79475c2bff add latency tracking and dynamic thread scaling
- dbs.py: add avg_latency, latency_samples columns with migration
- dbs.py: update_proxy_latency() with exponential moving average
- proxywatchd.py: ThreadScaler class for dynamic thread count
- proxywatchd.py: calculate/record latency for successful proxies
- proxywatchd.py: _spawn_thread(), _remove_thread(), _adjust_threads()
- scaler reports status alongside periodic stats
2025-12-21 00:08:19 +01:00
Username
1e43f50aa6 style: normalize test file indentation 2025-12-20 23:19:22 +01:00
Username
e24f68500c style: normalize indentation and improve code style
- convert tabs to 4-space indentation
- add docstrings to modules and classes
- remove unused import (copy)
- use explicit object inheritance
- use 'while True' over 'while 1'
- use 'while args' over 'while len(args)'
- use '{}' over 'dict()'
- consistent string formatting
- Python 2/3 compatible Queue import
2025-12-20 23:18:45 +01:00
Username
d356cdf6ee docs: mark priority queue complete 2025-12-20 23:11:54 +01:00
Username
a694e441a4 proxywatchd: add priority queue for job scheduling 2025-12-20 23:11:49 +01:00
Username
c224c55afe docs: mark tor connection pooling complete 2025-12-20 23:02:30 +01:00
Username
af5e1ce4b0 proxywatchd: integrate tor connection pool 2025-12-20 23:02:26 +01:00
Username
bc945a33ff add tor connection pool with health monitoring 2025-12-20 23:02:21 +01:00
Username
ce79ef7d7f engines: consolidate extract_urls with base class method 2025-12-20 22:50:46 +01:00
Username
4780b6f095 fetch: consolidate extract_proxies into single implementation 2025-12-20 22:50:39 +01:00
Username
9588da92e7 scraper: remove dead InstanceTracker class 2025-12-20 22:50:34 +01:00
Username
3188d50707 docs: update TODO and ROADMAP with completed work 2025-12-20 22:28:57 +01:00
Username
bef12e6bcf searx.instances: update with active SearXNG instances 2025-12-20 22:28:52 +01:00
Username
f289057267 cleanup: minor fixes in comboparse and soup_parser 2025-12-20 22:28:47 +01:00